BELLMAN FUNCTION TECHNIQUE IN HARMONIC ANALYSIS 
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Abstract. It is strange but fruitful to think about the functions as random pro- 
cesses. Any function can be viewed as a martingale (in many different ways) 
with discrete time. But it can be useful to have continuous time too. Processes 
can emulate functions, expectation of profit functional on the solution of stochas- 
tic differential equation can emulate the functional on usual familiar functions. 
The advantage is that now we have "all" admissible functions "enumerated" as 
solutions of one stochastic differential equation, and choosing the best function 
optimizing a given functional becomes a problem of choosing the right control 
process. But such problem has been long since solved in the part of mathematics 
called Stochastic Optimal Control. So-called Bellman equation reduces an infi- 
nite dimensional problem of choosing the best control to a finite dimensional (but 
non-linear as a rule) PDE called Bellman equation. Its solution, called Bellman 
function of a given optimization problem, gives us a lot of information about op- 
timum and optimizers. This method gave some interesting results in the classical 
Harmonic Analysis, having on the surface nothing to do with probability. Some- 
times the results obtained by this method did not find "classical" proofs so far. It 
is especially well-suited to estimates of singular integrals, probably because of the 
underlying probabilistic structure of classical singular integrals. 

1. QUASICONFORMAL MAPS: SHARP DISTORTION ESTIMATES AND SHARP 

REGULARITY 

We deal first with Beltrami equation 

(1) f-z-ftf. = 0, 

with bounded function /i called Beltrami coefficient, for simplicity fi is compactly 
supported on C, / being analytic near oo (see ([T])) supposed to have the following 
Laurent decomposition at infinity 

(2) f{z) = z + co + — + ... . 

z 

If fi is smooth it is not difficult to see that the solution is smooth on the whole 
C. But we are interested in just measurable bounded /i: 



(3) ||/i||i^(c) = k <1. 

Several natural questions appear: 

1. What is the smoothness of / depending on yU, A;? 

2. What are distortion properties of /? How it distorts the area and other 
measures? 

3. In what classes (Sobolev, say) we can solve ([1]) in such a way that it will be 
continuous C — )■ C, where C = C U oo? As at infinity it is a perfect holomorphic 
map, this question concerns only finite part of C, so it is local. 
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Denote g := fz- It is a function with compact support. We can restore /: 

(4) fiz) = - [ -^giO dm2{C) + CO + Z. 

We used ([2]) and we naturally assume integrability of g. 
Then obviously 

where the integral is singular and should be understood , e.g. in the sense of principal 
values. This is an important operator called the Ahlfors-Beurling transform (AB 
transform) : 

Then ([1]) automatically becomes 



(5) g - fiTg = {I - fiT)g = h, 

where h = ^ is bounded with compact support. So in particular h G np>iLP(C). 

It is easy to make Fourier analysis of convolution kernel ^ of AB operator, and to 
see that it is the Fourier multiplier with symbol C/C- Therefore, ||T||2,2(c) = 1 and 
having then ||/iT||2,2(c) < < 1, we can conclude that ([5]) has a solution in L^(C) 
given by the usual Neumann series: 

(6) g = h + fiTh + fiTfiTh + . . . , . 

Notice that g is compactly supported (as fi is). Restore f hy @. The boundedness 
of T in implies now that 

(7) / e WlJC) . 

Let us see now that g given in (|6]) is actually better than in L^. Operator T has 
norm 1 in and it has norm close to 1 in L^, p > 2,p ^ 2. In fact, it is an operator 
with Calderon-Zygmund kernel, and as such it is bounded in all L^. Interpolating 
between, say and L^, we get 

(8) ||T|U.(c)=:n(p)->l,p^2. 

So we can find such ap = p{k) = 2+e{k),e{k) > 0, that the series in ([6]) converges 
in this L'^^'^^^\ So g G L'^^'^'^'^\ Again restore / by formula (jl]) (it is the same / of 
course), again use that fz = T fz + 1 = Tg + 1, and that T is bounded in all W 
being a Calderon-Zygmund operator. We got that / self-improves from ([7]) to 



(9) f^WlJC), p = 2 + eik),e{k)>0. 

We formulate this small fact as a fundamental Ahlfors-Bers-Bojarski's theorem: 

Theorem 1.1. Any solution of ([T]) in W^i^^ self-improves to being in W'^'l^}''\ 
e{k) > 0. In particular, any such solution is continuous on C and even Holder 
continuous. There exists a solution, which is a homeomorphism of C into itself. 
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New questions appear: 

4. What is the largest 2 + e{k)? 

5. What is n{p) in (JED? 

To this we want to add some more questions. Introduce a constant 

It has a geometric meaning: it gives the maximal ratio of the axis of infinitesimal 
ellipses obtained as the images of infinitesimal circles by all possible solutions of ([T]). 
Definition. Any solution of ([1]) from Theorem 11.11 is called K-quasiregular map. 
Any homeomorphic solution is called K-quasiconformal map or K-quasiconformal 
homeomorphism. It is basically unique because by normalization at infinity it can 
be only shifted. 
Again questions: 

6. What is the sharp distortion of K-quasiconformal maps? Namely, if /(ID) = D, 
where © denotes the unit disc and /(O) = 0, then what is the sharp (largest) 
exponent in 

(10) yE C^, \f{E)\<CK\Ef^^l 

Without normalizations, allowing / to be any K-quasiconformal map this becomes 
the question what is the best (largest) exponent in 

Function 

fo{z) := zlzl^^-^ , \z\ < 1, and = 2; , for \z\ > 1 
shows that e{K) < ^. 

Gehring's problem: e{K) = It is equivalent to saying (we will se that) in 
Question 4 the sharp exponent of Sobolev integrability is 2 -|- e{k) = 1 + ^— . This 
is very tough, but it was done by Astala [T]. 

Glance now at (jH]) : it gives that if we want to show that the exponent p of Sobolev 
integrability goes up to 1 + ^ — , it is enough to prove that 

||T||^i+i/fc = 1/k, 

in other words that 

(12) \\T\\l,=p~1,p>2. 

This is very open, we will show how Bellman function gives partial results. 

Big Iwaniec's problem or p — 1-problem: n{p) = max(p — 1, — 1). 

Yet another question naturally arises: in Theorem 1 1.1 1 we started with a priori solu- 
tion in Wfi^^. How much helow this we can start to have the same self-improvement? 

7. Let / G W^f Joe solves ([1]), and q G (1, 2). What is the smallest q = q{k) such that 
we still have for each such / self-improvement to W^i^J (And then automatically 
to H^i^X^*^^ by Theorem ini and then up to Wl^^J'"' by Astala's [1]?) 

Iwaniec's problem: q{K) = 1 + k. 
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We will prove it here using Bellman function technique and Astala's sharp distor- 
tion result [1]. We follow the exposition of [2]. 

1.1. Invertibility of Beltrami operator. If Big Iwaniec's problem were solved 
than we would immediately get 

(13) If J9G [2,1 + 1/A;), then ||(J - /iT)-^^^ < . 

(Actually even with C{k) = 1/k, but this we do not care about as k is fixed and we 
vary p.) 

By duality and small talk one would get 

(14) li pelk:= {I + k,l + 1/k), then \\{I-fiT)-^\\Lv< ^^^^ 



dist{p, R \ Ik) 

Big Iwaniec's conjecture is still a conjecture, but this is a Theorem of Petermichl- 
Volberg [SS], which we start to prove now. It will use Bellman function technique. 
Notice that now there exists an even more precise version of this result, namely, see 

Theorem 1.2. If p e /, := (1 + fc, 1 + 1/k), then \\{I - ^,T)-'\\lv < • 

Let / be a K-quasiconformal homeomorphism, let p G [2, 1 + 1/A;), where k = 
Denote Jf = l/^p — the Jacobian of /. We need lemma: 

Lemma 1.3. Let f, p be as above, denote w := Jj Then w G A2 with 

p'CjK) 

[WU2 < T— 1 • 

Remark. We will give the proof following [2]. There is another very interesting 
proof in [3j . 

Proof. Notice first that 

(15) (1 - k')\f.\' <jf = - i/.-r < i/.p < i/.p + \h? . 

That is all this quantities are comparable with C = C{K). The next step is to 
show that there is C (K) such that if i? C C is a disc and if / is a K-quasiconformal 
homeomorphism of C, then 

1 /•„., pc(j<)_njvm\ 



(16) ^y;iM+iM)'s^^._^^ ^ 

Using linear maps to pre-compose and to post-compose with / we reduce it to 
normalized case |/(-B)| = \B\ = 1. Apply (ITT]) proved by Astala in [1] to the set 

Et = {zeB:\f,\^ + \f,\'>t},t>0 



we get 



\Et\ <-! (I/.P + \f,?)dm, <K- f (|/,p - \f,?)dm, 

^ JEt ^ JEt 



K^\f{E,)\<C,{K)^\E,\^. 
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Therefore, 

\Et\ <min(l,C2(i^) ^ ' 



K 



This is the same as 

1 



\{zeB: \M + \f,\ >t}< min(l, C,{K)- 



Distribution function calculation now shows 



(17) 

p poo xp~l roo -1 -| 

/ m+\Mr < c'+c"p / = c'+c"p / —^dt < c'+c"p-—, — 

Jb Ji t~ Ji t^^k-P l + -j:-p 

as -^3T = 1 + i This proves ffTB]) . 



Now we are ready to prove Lemma [L3l Notice that w = Jj = {Jf-^ ° fY^"^ 
Then 

j^Jwdm, = jL-^ ^( J/- o fr/'-\z) dm,{z) = ^ ^^^^ '^'A^^^J0§) ^"^2^^) ' 
where we made the change of variable z = /^^(C)- We continue 
1 / , 1 / ,p/2,„ , ,,,, pC(K) ( \B\ \"'\!(B)\ 

So 

p/2-l 



|5| JI, - . 

We used here f|T6|) with K-quasidisc /(-B) instead of a disc -B. But this does not 
matter as any K — quasidisc (:= the image of a disc by K-quasiconformal map) is 
an almost disc with constants depending only on K. 

Now notice that we assumed p > 2, so if p„ := p — 2 we can write 

^ ^ dm2 = ^ lyfY^'-'iz) dm^{z) = ^ ^ JT'^i^) dm^iz) , 

and we use again (fT6|) with p„ replacing p, gives us: 
(19) 



J_ f -1, ^ Png(i^) / |/(i?)i y"^^_ max(C-,p-2)C(A0 / |/(i?)| Y 



Multiplying ( |T8l) and ( |T9l) we get Lemma 11.31 
(20) w = jy\pe [2, 1 + h ^ Ma. < ^'^^^^ 



□ 



A;^ 1 + i-p 

Now we can reap a first consequence: 

Theorem 1.4. Suppose ||/i||oo = /c < 1, /ei p G /fc := (1 + fc, 1 + ^). T/ien operators 
I — fiT, I — Tfi are boundedly invertible in W . 



6 ALEXANDER VOLBERG 

Proof. We can work with / — fiT as / — Tfi = T{I — fiT)T~^ and T is boundedly 
invertible in each L"^, 1 < g < oo (T^^ is again a Fourier multipher of Calderon- 
Zygmund type). 

Suppose we know how to prove the estimate from below 

(21) II (/ - fiT)g\\p > c{p, k)\\g\\p,Wg e ^^(C), p e h . 

Then we would now exactly the same for I — /iT, and, so, for the adjoint operator 
(/ — ^T)*. Then / — /iT would have dense images in all we consider. Joining 
this with the estimate from below (!2T]) we would conclude that / — /iT are invertible 
in in all LP , p G Ik- 

So it is enough to have ( l2Ti) . And we would like a good estimate for c{K,p) in it. 
It is enough to prove fl2Tl) for the dense set of functions 



geC^iC), [ gdm2 = 0. 
Jc 



Let be the Cauchy transform oi g: (f) = ^ J c?^2(C)- 
Denoting h := g — fiTg we come to equation 

02 — ^(pz = h ,m which we want to estimate ||0z||p < C{K,p)\\h\\p ii p E Ik ■ 

By Theorem 11.11 there is a i^-qc homeomorphism / satisfying /j — fif^ = 0. Set 

and let us see how equation 0^ — /i02 = h will be transformed by this change of 
variable. 

We calculate 

0s = (Uz O f)fz + (Ug O f)fz 
(j)z = (Uz O f)fz + (Uz O f)fz 
(l)z - fJ'(l)z - h = {Uz O f)fz + {Uz O f)fz - fi{{Uz O f)fz + {Uz O f)fz) - k 
(Uz O f)fz - KUz O f)fifz -h = {1- \fi\'^)iUz o f)fz-h = 0. 

Hence obviously 

I \u,of)\P\f^\P<CiK) J \h\P^ J \u-zof)Yjf-^Jf<C{K) j \h\ 
And changing variable we get 



' 2 ; 
f-z, 



(22) j \u,Y\{Jf-.Y^^/' = j \u,n{Jf o /-i)p/2-i < c{K) j \hY 

On the other hand. 

Denote by W := {Jj^if-^'^. It is the one in Lemma 11.31 only / replaced by / ^ , 
which is a i^-qc homeomorphism as well. 

But the last expression above can be written as 

j \uznjf-.Y-^/^ = j \T{u,)\^{Jf-.f-^l' = j \T{u,)\m =:TU 
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(in fact, u can be restored from Ug by Cauchy integral with no addition because u 
vanishes at infinity; then Uz = T{uz)- Notice that (l22l) says that 

(23) j \u,\^W <C{K)\hY. 

Combine fl23|) with (here F is some "unknown" function on [l,oo), but finite for 
all finite arguments) 

TU = j \T{u,)\m = F{[w]a,) j \u,YW 

to get 

(24) m\v<C{ j + j \T{u,)\PW)<C{K){l + F{\wUMh\\l- 

Noticing that Lemma fL3] gives the estimate [w]a2 < "we conclude finally 

that 

II^^IIp < C{K)F{ f^[^^ ) \\h\\, , if p 6 [2, 1 + i) . 

We need the same estimate now for 1 + k < p < 2. We need W E Ap (now 
p < 2. But it is the same as to say that g Apt^p' = p/{p ~ 1). In 

our case W := (J/-i)^~^/^, so will be {Jf-i)~~^, which is inverse to the 

one in Lemma ll.3[ so also in A2 C Api. We get for the whole interval of p's: 
p e 4 = (1 + /c, 1 + i) also 

(25) ||^||p<F(max( ^^, ^ ^ . \ / ^ \)\\h\\p = F{ ^ ^ ^ 



Theorem 11.11 is proved. 



□ 



In [2] the following conjecture was formulated that claims that function F in 
(I25II is just linear. Notice that this would in fact easily follow from Big Iwaniec' 
conjecture. 

Conjecture 

(26) \\g\\p < ^'^^^^^ J ||/^||,,equivalently IKJ-^r)"!, < ^'^^^^ ^ 



dist{p,R\Ik)'" ' - dist{p,R\ Ik) 

We will prove now this conjecture using the Bellman function technique. But first 
let us derive the corollary of the conjecture. As always ||/i||oo = k < 1. 

Theorem 1.5 (Corollary of the conjecture). Any solution of 

F,-fiFz = 0, 

which is in VT^^c ^■^ automatically in W^i^^, and so satisfies Theorem ] 1.1[ It auto- 

matically self-improves then (by Astala's ^Jj to he in W^iJ^^ . 

First use Conjecture to prove 

Lemma 1.6 (Behavior at the end points of interval Ik)- Operators I — fiT, I — Tfi 
have dense range in L^^k and, correspondingly, trivial kernels on L^^^ . 



8 



ALEXANDER VOLBERG 



Proof. By T{I ~ fiT)T ^ = I — Tfi and invertibility of T in all spaces L^, 1 < p < oo, 
it is enough to prove just the dense range of / — fiT in L^^k. Consider e > and 
equation 

(27) 0. - (1 - e)^T(t), = h 

for nice h G . We want to consider the solution for = 1 + ^- We consider this 
Po in I{i-e)k because ||(1 — £:)/i||oo = (1 ~ Point po is obviously C{K)e close to 
the right end point of 

Hence, applying conjecture we conclude that 

II0.IU<^II^IU- 

Notice two things: 1) In the norma of 0e are uniformly bounded by C{K) just 
by using Neumann series in LF' in ( 127|) : 2) in the norms of T{e(j)^) are uniformly 
bounded. It is immediate to conclude from 1) and 2) that 

efiT(j)^ converges weakly to zero in . 

Rewrite our equation (127|) as follows: 

0e - nT(f)e = h- e^T(j)e . 

The right hand side weakly in L'^*^ converges to any function h, whose family is 
strongly dense in L^°. So the right hand side is weakly dense in L^" = L^~^k. But it 
is in Range{I — fJ^T), so this range is weakly dense in L'^'^k. Being a linear set this 
range is then strongly dense in L^+fc. Lemma [1.61 is proved 

□ 

The proof of TheoremlLM Consider - fiR, = 0,R e W^l^^. Choose (p G C^. 
Set G = <pR. Then 

- = (</)g - fi(f)z)R- 

Looking at this formula we can start to think that the support of fi is compact (is 
contained in the support of (p). 

As G vanishes at infinity it is the Cauchy transform of its dG = Gg, and therefore, 
Gz = TGz- We can rewrite the equation 

(/ - ^iT)%l) = h- ^ := , h = {(f), - n(j),)R G C ^^(C) H ^^^"(C) . 

The inclusion above for function R is by Sobolev imbedding, in fact, we assumed 
that R G W^i^/^c' ^^"^ t)y the compactness of the support of R. It has been already 
remarked, that in the last two equation we have the right to think that n = 
outside of the support of (p. Let us consider the convergent in L^(C) of the series of 
compactly supported functions: 

ipo = h + fiTh + fiTfiTh + . . . . 

It solves our equation, it is in -^^^(C) and it is compactly supported, hence it is in 
L^+*''(C). Therefore we got a solution ipo of (/ — fiT)ipQ = h, which is in fl L^. 
But ip = Gz is also in L^"*'^. By Lemma [L6I we have ip = ipo ^ L^- It means that 
R G W^i^^. Theorem 11.51 is proved. 

□ 
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j_ 1 

Example. / = for z E B, f = ^ outside D. It is a solution of Beltrami 

equation with /i, ||/i||oo = k = and it is in W^i^^ for every q < 1 + k. But it is 
NOT K- quasiregular mapping, it has a singularity at 0. 



This example shows how sharp is Theorem 11.51 Its proof hinges on Conjecture 
We prove this Conjecture now using Bellman technique. First we analyze function 
F from Recall that W := {Jf-iy-P/"^, and if p G (1 + A;, 1 + i) the estimate in 
(125|) is Fp([l^]^2), where Fp is the best function one can have in the estimate 

\\T\\lhw) < Fp{[W]A,) . 

Suppose we can prove 
Theorem 1.7. F{x) < Ca;'"''^^^'^/^^-^)) . 

Then we recall that for 1 + k < p < 2, p' := we already estimated [M/^lY^^"^"* = 

[iy"V(P-i)]^^, < < And for 1 + i > p > 2, [W^ <'^p- 

These estimates were based on sharp distortion theorem of Astala [1]. We made 
these estimates in Lemma [L3l These estimates and Theorem 1 1 . 71 1 hen imply trivially 
conjecture (126|) . Hence this Theorem 11.71 is the only ingredient left to be proved to 
have Theorem 11.51 

2. Linear estimates of weighted Ahlfors-Beurling transform by 

Bellman function technique 

Let u be any weight on M^, denote its heat extension into M.^ by u){x,t) = 
uj{xi,X2,t): 



1 f f \\x — y II ^ 
uj{x,t) = — / uj{y)exp{ )dyidy2 



We define 



[ij]X^' := sup u{x,t){u p-^{x,t) 

{x,t 



1 



p-1 



The weights w with finite [w]^^"* are called Ap weights. There is an extensive 
theory of Ap weights, see for example [SS],|1D]. The usual definition differs from the 
one above, but it describes the same class of weights. Actually, we will say more 
about the relationship between the classical definition and ours. But first we state 
two more theorems, whose combined use gives Theorem 11.71 at least for p >2. 

Remark. The method called Rubio de Francia extrapolation-one can see its ex- 
position in [27]-actually shows that to have a full range of p's in Theorem 11.71 it is 
enough to prove it only for p = 2. 

Theorem 2.1. For any Ap weight w and any p > 2 we have 

\\T\\LP(u,dA)-,LP{u,dA) < C(p)(H^'^^"*)^ . 

We want to discuss the connection between and [w]^"**''. Here [wj^l"**"* 

denotes the following supremum over all discs in the plane: 

[w]^"** ■= sup ( ^ [ udA\ ■ ( ^ [ u~p^dA 
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Obviously, there exists a positive absolute constant a such that for any function 



w 



Remark. The opposite inequality is easy to prove too. In fact, we have 
Theorem 2.2. There exists a finite absolute constant b such that 

Proof. Constants will be denoted by the letters c, C; they may vary from line to line 
and even within the same line. We introduce the following notations. denotes 
5(0,2^), k = 0,1,2,..., {f)B stands for the average Jg fdA, f{B) stands for 

fdA. \iB = S(0, r), then (/)^ stands for ^ //j,, f{x) exp(-M!)t/xirfx2. 

Lemma 2.3. Suppose f and g, positive functions on the plane, are such that 
supfi if) 3(9)3 = A, then there exists a finite absolute constant c such that 

{f)3{9)% < cA 

for any disc B. 

Proof. Scale invariance allows us to prove this only for one disc B = -8(0, 1). We 
start the estimate: 

{f)3{g)'3<c{f)3^k2''eM-2"'-')j^- 

On the other hand > c{f)B^_^ > ■ ■ ■c^{f)B (recall that B is the unit disc). 

Plugging this in the inequality above, we get 

{f)3{g)%<c{f)B^kC''eM-'2''''' ^ 



'{f)B- 

In other words, 

< cASfeC"=exp(-22'=-2) = cA 
and the lemma is proved. □ 

Now we want to prove Theorem l2.2[ Fix B. Again by scale invariance it is enough 
to consider B = B{0, 1). By the previous lemma, we know that 

(28) (/)b,(^7)1<cA 

for any k. 
Now 

rA 

{f)'h{9)3 < c(^7)|S22'=exp(-22'=-2)(/)^, < c{g)'B^2''' exp{-2'''~'' 



' i9A ■ 

The last inequality used f l28|) . 

On the other hand, {g)B^. > > • • • ^^{9)3 (recall that B is the unit disc). 

Plugging this in the inequality above, we get 

rA 

imgYh < c{g)%J:,C'^exp{-2"''')-^. 

\9l 3 

In other words. 
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{1)^9)3 < c'AJ:,C'expi-2''-') = c'A. 
Theorem 12.21 is completely proved. 



□ 



The next result proves Theorem 11.71 for p = 2. We will show later how to extrap- 
olate just from the result at j9 = 2 to all possible p's. 

Theorem 2.4. For any A2 weight w we have 

\\T\\L2{wdA)-^L'^{wdA) < C [wY^'^"'' . 

Proof. There will be many steps. But we are going to prove only Theorem 12.11 and 
only for p = 2. By Theorem 12.21 and Rubio de Francia extrapolation this is enough. 

c - e. 



The operator T is given in the Fourier domain (^1, ^2) by the multiplier 7 = iatt 



= 7^ - 7^ + 2^#S^- Thus, T can be written as T = i?^ _ ^ 2iRiR2, 
where Ri,R2 are Riesz transforms on the plane (see [55] for their definition and 
properties). Another way of writing T is 

T = mi + im2 , 

where mi,m2 are Fourier multiplier operators. Notice that the multipliers them- 
selves (as functions, not as multiplier operators) are connected by 

m2 = mi o p , 

where p is 7r/4 rotation of the plane. So the multiplier operators are related by 

m2 = UpTTLi , 

where Up is an operator of p-rotation in {xi,X2) plane. But for any operator K we 
have 

\\UpKU~^\\L-2{wdA)^L'^(wdA) = \\K\\L2{wop-^dA)-^L'2{wop-idA) ■ 

Combining this with the fact that Q^^2* — Q^op-^ 2 rotation, we conclude 

that we only need the desired estimate of Theorem 12.41 for mi = R\ — R\. Actually, 
we will show that 

(29) \\Rl\\L^{wdA)^L'^(wdA) < CQ^^^, i = 1,2. 

To prove fl2.6p we fix, say, R\ and two test functions v^, ^ . We will be using 
heat extensions. For / on the plane, its heat extension is given by the formula 

f{y,t):=— [[ /(x)exp(-^^^)rfxirfx2, (l/,t)eM^. 
'n't JJm.2 t 

We usually use the same letter to denote a function and its heat extension. 

Lemma 2.5. Let if,ip& . Then the integral JJJ ^ ■ ^ dxidx2dt converges 
absolutely and 

dip dip 



(30) j j Rl^ ■ tp dxidx2 = -2 




dxidx2dt . 
0x1 0x1 
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Proof. The proof of this lemma is actually trivial. It is based on the well-known fact 
that a function is an integral of its derivative, and also involves Parseval's formula. 
Consider and now 




oo 

2_i_c2 



e-2*(«+«^^)eiV(ei, 6)^(^1, i2)di,di2dt 







J J ^ei^(ei,6)e-*«'+«^') ■ tm-^i, -i2)e-'^^"'-^"-^di,di2dt 
-2 I 1 1 ■^—{xi,X2,t)-^{xi,X2,t)dxidx2dt = 



Jo . 

-2 /// -K—{xi,X2,t) — {xi,X2,t)dxidx2dt. 

O O O 1 1 

Above we used Parseval's formula twice, and also we used the absolute convergence 
of the integrals 

OXi OXi 

For the first integral this is obvious. The absolute convergence of the second integral 
can be easily proved. We leave this as an exercise for the reader . 

□ 





Our next goal is to estimate the right side of fl30l) from above. 



Theorem 2.6. For any (f,ip^ , and any positive function w on the plane we 
have 




dxi 



d-ip 



dxi 



dxi dx2 dt < AQ 



heat 
w,2 



\(f\'^w dxi dx2 + 



dxi dx2 

w 



where A is an absolute constant. 



Bellman function 

In the proof we at last use a Bellman function tailored for this problem. It is B 
from the following theorem. The meaning of Q in the next theorem is Q := [w]^^^. 

We use the notation Hf for the Hessian matrix of function / : M'^ — )■ M (the 
matrix of second derivatives of /), and d'^f for the second differential form, which 
is the quadratic form {Hf{x)dx, dx), where (■, ■) is the usual scalar product in R^, x 
is a point in the domain of definition of /, dx is an arbitrary vector in M^. 
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Theorem 2.7. For any Q > 1 define the domain Dq := {0 < {X,Y,x,y,r,s) : 
< Xs, y"^ < Yr, 1 < rs < Q}. Let K be any compact subset of Dq. Then 
there exists a function B = BQ^K{X,Y,x,y,r, s) infinitely differentiable in a small 
neighborhood of K such that 

1)0 < 5 < 5Q(X + r), 

2) -d^B > \dx\\dy\ . 

We prove Theorem 12.71 later. Now we use it to obtain the proof of Theorem 12.61 

Proof. Given a non-constant smooth w that is constant outside some large ball, we 
consider Q = Q^^. We treat only the case w G ^2, that is Q < oo, for otherwise 
there is nothing to prove. Consider two nonnegative functions (f,ip& C^. Now take 
B = Bq xi where a compact K remains to be chosen. 
We are interested in 



b(x,t) := B({Lp'^w){x,t), (^"^w '^){x,t), (.p{x,t),ip(x,t),w{x,t),w ^{x,t)) . 

This is a well defined function, because the choice of Q ensures that the 6-vector 
V, consisting of heat extensions of corresponding functions on M^, 

V := {{(f'^w){x, t), {ijj'^w~'^){x, t), Lp{x, t),^{x, t), w{x, t),w^^{x, t)) 

lies in Dq for any {x,t) G M.'^. Also we can fix any compact subset M of the open 
set M.^ and guarantee that for {x,t) G M, the vector v lies in a compact K. In fact, 
notice that for our w and for compactly supported ip, ip the mapping (x, t) — )■ v{x, t) 
maps compacts in M.^ to compacts in Dq. Now just take K large enough. 
The main object we want to study is 

(31) (^-A]b{x,t). 



dt 

For simplicity we assume that B is already up to the boundary of Dq. The 
technical details what to do without this assumption are left to the audience, see 
We want to estimate the expression in flHT]) 1) from above in average and 2) 
from below in a pointwise way. 

1) Take a "slab" S^^h ■= {ix,t) e Rl : e < t < H}. Notice that for any fixed 
positive t 

Ab{x,t)dx = 0. 



This is because we assumed B to be smooth and because v{x,t) — )■ for a fixed t 
when X — )■ oo rather fast, and the same is true for Vv{x,t). Hence, 

f (— a\ b(x,t) dxdt = f ^b(x,t)dxdt= f b(x,H)dx— f b(x,e)dx. 

JS,,H \^ J JS,,„ dt iR2 V 

Now we recall that b = B o v, that B > (so we can throw away a "minus" term 
above), and that B{X, Y, . . .) < 5Q{X + Y). Then we get (functions below are heat 
extensions of the corresponding symbols on R^): 



b{x, t) dxdt < 
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(32) 5Q / {ip'w{x,H)+i;'w-\x,H))dx = 5Q / [{ip'w){x) + {^lj'w-'){x)]dx . 

2) Now we make a pointwise estimate of (1311) from below. The next calculation 
is simple but it is key to the proof. In it as everywhere 

V = ((v?^w)(x, t), {ip'^w~^){x, t), ip{x, t),ip{x, t),w{x, t),w~^{x, t)) . 



Lemma 2.8. 

d 



(I -a) b{x,t)=^{-d^B) 



dv dv 



dxi dxi 



dv dv 



Proof. 



d 



dv. 



-b = {VB,— 



dt 



dt 



Ab 



dv dv 



dxi dxi 

We just used the chain rule. Now 



dv dv 



dx2 dx2 



dx2 dx2 



+ {VB,Av) 



,dv 



id'B] 



dv dv 



dxi dxi 



(d'B) 



dv dv 



dx2 dx2 



However, the first term is zero because all entries of the vector v are solutions of the 
heat equation. □ 

By Theorem 12.71 



-d^B > \dx\\dy\ . 



Therefore, for (x,t) Lemma [2.81 gives: 



(33) 

Combining 



|-A)6(..t)> 



d(p 



dip 



we get 

dip 



dxi 



dx^ 



+ 



dip 



dxo 



dxi 

dip 



dip 



dxi 



+ 



dip 



dxo 



dip 



dxo 



< 5Q 



dxo 



(p^w +11 ip'^w ^ 



Theorem 12.61 is completely proved by using a Bellman function of our problem 
whose existence is claimed in Theorem 12.71 

□ 



Theorem 12.41 is proved. 



□ 



Remark. The proof of Theorem 12.61 is actually "equivalent" to solution of an 
obstacle problem for a certain fully non-linear PDE. Consider 

[0, 1" 

.1' ■ 

Then we need Hs + cr > in each point in Dq. As we a looking for the best possible 
B satisfying these relationships, it is natural that we should require 

det{HB + (x) = or det(i^B - a) = , 
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these are Monge- Ampere equations. This approach is used in [59], [5l]. Now 
we use another method to prove the existence of Bq required in Theorem 

2.1. More Bellman functions to prove the existence of Bellman function 
Bq from Theorem 12.71 Dyadic shifts. We start with a much simpler "model" 
operator — T^.. The logic will be the following. We want to get a sharp weighted 
estimate of ||To-||l2(^)_^2.2(«,) via the A2 characteristic of w. In the paper of Nazarov, 
Treil, Volberg, see [SU], one can find that the norm | |To-| 1^,2 2,2(t,) is attained on 
some "simple" test functions — and that this holds for every pair u,v. Thus also 
for u = V = w. However, on the family T of test functions one can compute the 
Nu,,2{Tct) ■= sup{||r^t||L2(^) : t eT, \\t\\L2{w) = !}• It turns out that 

Theorem 2.9. N^,2{Ta) ~ Q^J,T- 

J. Wittwer does that in [63] basing her approach on [50]; see also [56]. Thus, we 

get \KhHn.)^LH^) = N^AT.) ~ QtT- 

So let us show what the model operator is, what its sharp weighted estimate is, 
and how one obtains a special function (Bellman function) from this estimate. 

Consider the family of dyadic singular operators To-: 

Here "D is a dyadic lattice on M, hi is a Haar function associated with the dyadic 
interval / {hj is normalized in L^(M, dx)), and a{I) = ±1. We call the family the 
martingale transform. It is a dyadic analog of a Calderon-Zygmund operator. Here 
are important questions about T„, the first one about two- weight estimates and the 
second one about one weight estimates: 

1) What are necessary and sufficient conditions for sup^. \\T„\\l^{u)^l^{v) < 00? 

2) What is the sharp bound on sup^. \\T„\\l'^w)^l'^{w) in terms of w7 How can one 
compute sup^ ||T^||l2(«,)^l2(«,)? 

These questions are dyadic analogs of notoriously difficult questions about "clas- 
sical" Calderon-Zygmund operators like the Hilbert transform, the Riesz transforms 
and the Ahlfors-Beurling transform. The dyadic model is supposed to be easier than 
the continuous one. This turned out to be true. The answers to the questions above 
appeared in |50] , [63] . Moreover these answers are key to answering questions about 
"classical" Calderon-Zygmund operators. 

Strangely enough, the answer to the second question (which seems to be easier, 
because it is about "one weight" ) seems to require the ideas from the "two-weight" 
case. Here is our explanation of this phenomena. The necessary and sufficient 
conditions on {u,v) to answer the first question were given in [50]. They amount 
to the fact that sup^ 1 1 To-| 1^2 2,2(t,) is almost attained on the family of simple test 
functions. This fact has beautiful consequences in the one weight case. For then 
supo- \\Ta\\L'2{w)^L'^(w) is attainable (almost) on the family of simple test functions. 
One may try to compute sup^ ||To- t||L2(^) for every element of this test family, thus 
getting a good estimate for the norm sup^. \\To-\\l'2(w)^l^(w)- Test functions are rather 
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simple, so this program can be carried out. This has been done in Wittwer's paper 
[63] . We will give another proof below. Here is the result. Recall that 

lev 

Theorem 2.10. 

I IT' II ^ A /^dyadic 

sup \\^a\\L2{w)^L2{w) <: ^^w,2 ■ 
a 

Remark. We will postpone the proof of Theorem 12.101 (another use of Bellman 
function technique), here we will use it first to finish the proof of the existence of 
Bq claimed in Theorem 12.61 

So we assume now that Theorem 12.101 is already proved. Let us rewrite Theorem 
12.101 as follows 

sup \^iev(r{I){f,hi){g,hi)\ < Ag5'^''||/||L2(^) ||^||i2(^-i) , 

<7(7) = ±1 

or 

^i&v\{f,hi)\ \{g,hi)\ < Ag;^^f*'||/||i2(^)||^||i2(^-i) . 
This inequality is scaleless, so we write it as 

(35) 

JeV, - (/),J \{g)j_ - {g),J\I\ < AQ%'-{pw)f{g'/w)f. 

Here are the left and the right halves of /, and (■)/ means averaging over 

/ as usual. Given a fixed J & V and a number Q > 1, we wish to introduce the 
Bellman function of (l35l): 



5(X,F,x,?/,r,s) = sup{^S/e2,,/cj|(/)/- - (/)/+! |(^)/_ - : 
= X, {g)j = y, {w)j = r, {w'^)j = s, 
{fw)j = X, {g'Hj = Y,we A'r'^^ Qtaf' < Q} . 

Obviously, the function B does not depend on J, but it does depend on Q. Its 
domain of definition is the following: 

Rq := {0 < {X,Y,x,y,r,s), < Xs, <Yr,l<rs<Q}. 

By (!35|) it satisfies 

(36) 0<B< AQX^I'^Y^I'^ . 

We are going to prove that it also satisfies the following "differential" inequality. 
Denote v := (X, Y, x, y, r, s), f_ = (X_, YL, x_, ?/_, r_, s_), v+ = (X+, F+, x+, y+, r+, s+), 
let V, v^, w_ lie in Rq, and let v = |(i)_ + f+). Then 

(37) B{v)-^{B{v+) + B{v_)) > \x+-x_\\y+-y_\. 
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In fact, let /, w almost maximize B{v) (on the interval J), let /+, w+ do this 
for f-iQ-i do this for B{yJ). The freedom of scale for B allows us to put 

on J4. and f-,g-,w^ on J_. Then we have "gargoyle" functions 






Obviously, {F)j = \{x+ + x_) = {G)j = y, {W)j = r, {W-')j = s, {F^W)j = 
X, {G^W~^)j = Y. These numbers together form the vector v. In other words 
F,G,W compete with f,g,w in the definition (135|) of Bellman function B{v). By 
this definition, 

B{v) > ^S,e7,,/cj|(i^)/- - {F)i,\ \{G)j_ - {G)jJI\ . 

But the almost optimality of f+,g+, w+ on J+ and f-,g-, on J_ gives us (recall 
that F = /-t on J±, G = g± on J±): 

and 

B{v.) < e + ^S,e7,,/cjJ(i^)/- - - (G),J|/| . 

Combining these, we get 

B{v) - \{B{v^) + 5(t;-)) > -2e + - - (G),, 

= -2e+-^\{f^)j_ - - {g+)j^ = -2e + -^\x^ - x+\\y_ - y+\ . 

We are done with 0371) because e is an arbitrary positive number. Therefore, our 
i? is a very concave function. We are going to modify B to have its Hessian satisfy 
the conclusion of Theorem 12. 7[ To do that we fix a compact K in the interior of 
i?Q, and we choose e such that lOOe < dist{K, dRq). Consider the convolution of B 
with ^v^(^), V G M^, where is a bell shape infinitely differentiable function with 
support in the unit ball of M^. It is now very easy to see that this convolution (we 
call it Bk,q) satisfies the following inequahties 

(38) < Bk,q < 6QiX + Y) , 
and for any vector ^ = (^1, ^2, 6, ^4, ^5, ^e) e ]R^ 

(39) -{d'BK,Q^,OR^>2\m3\- 

The factor 2 appears because B{v) — ^{B{v+) + B{v_)) in (!37|) corresponds to 

— ^d'^B, and |x — x+l = — x+l (the same being valid with y's replacing x's and 

— replacing +). 

Theorem 12.71 is completely proved modulo the proof of Theorem 12.101 
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Proof of Theorem \2.1(K To prove Theorem 12.101 we need the following decomposi- 
tion: 

Lemma 2.11. 

(40) hi = aih"^ + , 

where 

1) \o!i\ < \f{w)~i, 

2m < where Ajw := {w)j^ - {w)j^, 

3) {hf}j is an orthonormal basis in L'^{w), 

4) hf assumes on I two constant values, one on /+ and another on J_. 

Proof. To find a, (3 we first apply || ■ ||^2(^) to both parts of (l40l) : {w)i = ||^/||i2(^) = 

+ 13'^ {w) I, and secondly we multiply f HU]) by xi/ ^/W\ and integrate with respect 
to w dx: ^{{w)j_^_ — = /3i{w)j. Clearly Lemma is proved. 

□ 

Now let 

SF := Ci{f, hi) hi , where constants Ci are such that |c/| < 1 . 
I 

Let a := for the rest of the proof. Fix (p G L'^{w),ip G L'^^cr). We need to 
prove 

(41) |(§0^,^a)|<C 



We estimate (S (pw, ipa) as 

\^^ci{(t)w,hi){'4)(T,hi)\ < 
I 

J2\ci{<pw,hr)^/Wiii^^,h'^i)\\^i\ + 
I 

\ci{<Pw)i^{,pa, h^i)^i^I\ + 
\ci{ija)j^{cpw, hr)^/Wi^I\ + 

J {(^)I 

V \ci{4>w)i{^a)j^^^^^n\ =:I + II + III + IV . 
So we have 

/< $^(0^,/^7)\/H7-(^^,/^J)/W7, //< 5^(0ti;,/in/H7-(^^)/^v^, 
III < Y{Mi\^^H^a,h^i)^i, IV < $:(0-)^TT^v^-(^-)/T^ 



BELLMAN FUNCTION TECHNIQUE IN HARMONIC ANALYSIS 19 

The estimate of / is trivial because hf, h" are ortho normal systems in L'^{w), L'^{cr) 
correspondingly: 



(42) / < sup ^{w)j{a)jp^{ct>w, hjf p^{i^a, h^.f < [w]](; 

To estimate the rest let us fix a G (0, 1/2) and introduce 
(43, 

We are going to give a Bellman function proof of the following lemma. 

Lemma 2.12. The sequence {/i/j/eD is a Carleson sequence with Carleson constant 
at most C [wW^. 

We take Lemma 12.121 for granted till the end of the proof of Theorem I2.1UI First 
introduce a notation, let be a positive measure on M, then 



M^^/(x):= sup ^ / l/M/i. 

I(zD,x&I Jl 



This is called dyadic weighted maximal function. We will use it with fi = wdx or 
adx. 

To estimate IV, II, and symmetric to it /// we notice that 
so, choosing p G (1, 2) 

(^)-"/2(a)J""/^nf (M^Vr(x))i/^ • ^, 
where is the dyadic weighted maximal function. Therefore, 

IV < ^(^)^-"(a)l-"inf(M^Vr)'/" ■ infiMMn'^" " f^i ■ 
I 

U< 2_^{(f)w,hj){w)j {a) I -—j-^ 

/ Hi 
The estimate of /// will be totally symmetric, so we omit it. We continue: 

IV < E inf (M^Vr)!/^' ■ infiM^m'/P ■ /i, . 



Choose F = {M^\ij\py/P ■ (M^|0|p)1/p and G = {M^\ij\P)^/P and apply the follow- 
ing simple lemma (Exercise!) 
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Lemma 2.13. Let {aL}L&D define Carleson measure with intensity B. Let F he a 
positive function on the line. Then 



(44) 



V(infF)ai <2B ! F dx . 



(45) y'4^c.L<CB f^dx. 

^ Hl JrW 

Then using Lemma [2.121 we get 

IV < [w]\r[MX [ {MMn'^' ■ (M^m'/^ dx 



wU, / {M^l^lPy/P ■ {M^\(f)\PY/Pw^/^a^/^dx < 



MA.y j mWY'^^dx^ j {Mi\<P\Pflv adx < C[w]a,\w\\^\ 
As to //, we have again using Lemma [2.131 (the second part) and Lemma [2. 121 



Theorem 12.101 is completely proved, function Bq is constructed. 



□ 



We need only to see the validity of Lemma I2.12[ This is done by yet another 
Bellman function. 

Bellman proof of Lemma \2.12[ We prove even a more general statement, namely we 
prove the version in R"' and even in each metric space with geometric doubling 
condition and doubling measure /x. So let us have a metric space with geometric 
doubling condition, meaning that every ball of radius r can fit only at most K 
disjoint balls of radius r/2, K being independent of the ball and its radius. Such 
metric spaces carry a doubling measure /i by a theorem of Konyagin-Volberg [IS], 
and let D denote the family of "dyadic cubes" on this metric space (constructions 
are numerous, the first belongs to M. Christ ^26]), and let Si{I) are dyadic children 
of / e -D. Finally, let I e D and let 

Lemma [2.121 b ecomes the following statement, which we are proving below: 

(46) v/GD ^^J<Ca[w]lAAI). 

JeD,Jci 
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Let Q > 1,0 < a < ^. In domain Qq := {{x,y) : x > 0,y > 0,1 < xy < Q} 
function bQ{x,y) := satisfies the following estimate of its Hessian matrix (of 
its second differential form, actually) 

-d%Q{x, y) > a{l - 2a)x V f ^ + ^ 

The form —d%Q{x, y) > everywhere in x > 0,y > 0. Also obviously < &q(x, y) < 
Q'' in Qq. 

Proof. Direct calculation. □ 

Fix now a cube / and let Si{I),i = 1, M, be all its sons. Let a = ((w)^,/, {<j)^j), 
bi = {{w)f,^s,{i), (c^)/.,s,(/)), « = 1, . . . ,M, be points-obviously-in Qq, where Q tem- 
porarily means [w]a2- Consider Ci(t) = a{l—t) + bit, <t < 1 and qi(t) := hQ^Ciit)). 
We want to use Taylor's formula 

(47) g,,(0) - qi{l) = -q[{0) - I dx I q'^it) dt . 

Jo Jo 

Notice two things: Sub lemma shows that —qi{t) > always. Moreover, it shows 
that if t G [0, 1/2], then we have that the following qualitative estimate holds: 

(48) - qm > c {{w),A-).ir ^ ~ ■ " 



This requires a small explanation. If we are on the segment [a,6j], then the first 
coordinate of such a point cannot be larger than C {w)^j, where C depends only on 
doubling of /i (not w). This is obvious. The same is true for the second coordinate 
with the obvious change of w to a. But there is no such type of estimate from below 
on this segment: the first coordinate cannot be smaller than k (w)^^/, but k may 
(and will) depend on the doubling of w (so ultimately on its [w]a2 norm). In fact, at 
the "right" endpoint of [a,bi] the first coordinate is < fj w dfi/ fj,{si{I)) < 

C fj wdfi/fi{I) = C {w)^j, with C only depending on the doubling of fi. But the 
estimate from below will involve the doubling of w, which we must avoid. But if 
t G [0, 1/2], and we are on the "left half of interval [a,bi] then obviously the first 
coordinate is > \{w)^j and the second coordinate is > \{a)^j. 

We do not need to integrate —q'/it) for all t G [0,1] in fH7|) . We can only use 
integration over [0, 1/2] noticing that —q'l{t) > otherwise. Then the chain rule 

% (i) = {bQ{c^{t)r = {d%Q{cm{h - a), h-a), 

(where (■, ■) means the usual scalar product in M^) immediately gives us (HHj) with 
constant c depending on the doubling of /i but independent of the doubling of w. 

Next step is to add all (H71) . with convex coefficients ^^^|jy^, and to notice that 
Eii = ^hQ{a) Efi -(a - = 0, because by definition 
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Notice that the addition of all ( 147|) . with convex coefficients ^^^(j^^^ gives us now (we 
take into account ( HHl) and positivity of —q'lit)) 



We used here the doubling of /i again, by noticing that ^'■^^jj-'^ > Ci (recall that Sj(J) 
and / are almost balls of comparable radii). We rewrite the previous inequality 
using our definition of A/w, A/cr listed above as follows 

bgia) - /.(..(/)) bQih) > cc, {{w} ( ^-^^ + ^^V^^) " 

Notice that bQ^a) = {w)'^j{a)'^j. Now we iterate the above inequality and get for 
any of dyadic J's: 

This is exactly the Carleson property of the measure {/x/} indicated in our Lemma 
\2.12\ with Carleson constant C Q". The proof showed that C depended only on 
a e (0, 1/2) and on the doubling constant of measure /i. Lemma [2. 121 is completely 
proved. 

□ 

3. Estimates for Ahlfors-Beurling operator. Towards the Big 

IWANIEC PROBLEM BY BELLMAN FOOTSTEPS 

In the previous section we estimated AB operator T in weighted L?'{w). The 
estimate was sharp in [w\a2'- 

(49) |(T/,(7)| <CMAj|/||L2Hlkl|L2(.-i), 
it implied a sharp in [w\ap estimate in weighted Uiw): 

(50) \{Tf,g)\ < Cip) [w]X^^''^')\f\M\9\\L.'i..-m.-^)) , P :=P/(p- 1) • 

But we did not care about C,C{p) at all. Now we consider just w = 1, but we 
care about C{p) very much. Big Iwaniec's problem conjectures 

(51) C{p) = max(p, p/(p — 1)) — 1 =: — 1 . 

This is open at the moment of writing this phrase. Using various Bellman 
functions we will show the row of improvements 

(52) C{p) < 2{p* - 1) . 

(53) C{p)<1.7{p*~l). 

(54) C{p) < 1.575{p* - 1) . 

(55) C{p) < lA{p* - 1) . 
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Recall things that we already know: 1) T = i?f — + 2ii?ii?2, where Ri are Riesz 
transforms = multipliers with symbol + |C2p)^''^5 = 1)2; 



(56) 



2) iRU,g) = -2 



df dg 
dxj dxj 



dxdt . 



where /, g in the left hand side are from C^(]R^), and /, g in the right hand side are 
heat extensions of functions in the left. 

Hence Conjecture [51] is nothing else as the following innocent looking conjecture 



(57) 2 



df ^.df 

+ 1- 



dxi dx2 



^ + i^T^ 1 dxdt 



dxi dx-2 



< (p-1 



l^llp' , P > 2. 



Let complex-valued functions f = u + iv,g = (j) + iip. Consider / := {u,v) as a 
map — )■ M^, do the same with G := {(f), We have Jacobian matrices DF,DG 
then. These are 2x2 matrices. 

Imagine that we want to have a stronger estimate than flS7|) (which is probably 
too much!): 



(58) 2 

This is exactly 
(59) 



df ^.df 
+ 1- 



dxi dx2 



dg dg 
+ 1- 



dxi dx2 



dxdt < (p- l)||/||p||^||p,, p>2. 



2 [[ {\DF\l-2detDFY/WDG\l-2detDGy/^dxdt < (p- l)||/||p||^||p/ , p>2. 
J J»l 

where | ■ I2 is the Hilbert-Schmidt norm of the matrix. 

Nobody can prove fl59|) and equivalent to it fl58|) . They may be wrong! 

However, we will start with proving slightly lighter estimates: 



(60) 



df 



dxi 



dg 



dxi 



+ 



df 



dxi 



dg 



dxi 



dxdt < (p- l)||/||p||^||p', P>2. 



Moreover, we will prove a stronger than fl60|) (but weaker than (1581) ) estimate 
(61) 



df 



dxi 



+ 



df 



dxo 



2\ 1/2 



dg 



dxi 



+ 



dg 



dXn 



2\ 1/2 



dxdt < (p-l)||/||p||^||p/, p>2. 



This will give us f l52|) . f l53|) correspondingly. To get to fl54|) and further improve- 
ments as f l55|) we will need a bit more (stochastic integrals). 

Notice that we already know (by fISB]) ) that fl^U]) immediately proves the following 

Theorem 3.1. \\Rl - Rl\\p < p - 1, p > 2 . 

Because 2R1R2 = U o {R\ — R^) o U^^, where U is an isometry in all spaces 
(in fact, U is the rotation of the argument of function by 45°), we get (!52|) from 
doubling the claim of Theorem 13.11 
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proof of ( 16011 . The first step is by examination of what we aheady had in Section 
12.11 after the statement of Theorem I2.10[ We do now exactly the same: 

Suppose we have the following inequality for functions on interval [0, 1] 
provided with dyadic lattice V: 

(62) J:j^j,\{f,hj)\\{g,hi)\ < (p-l)||/|U.|kllL.',P>2. 

This inequality is scaleless, so we write it as 

(63) 

JeV, -^^^iev,icj\{f)i- - \{9)i- - {9)iM <ip-l) {\fn'J'{\9\'yj'' • 

Here /_, J+ are the left and the right halves of J, and means averaging over / 
as usual. Given a fixed J G p > 2, we wish to introduce the Bellman function of 

Bp{X,Y,x,y) = sup{^S/ei,,7cj|(/)/- - (/)/+! \ {g)i- - : 

{f)j = x, {g)j = y, {\fr)j = X, {\gf)j = Y}. 

Obviously, the function B does not depend on J, but it does depend on p. Its 
domain of definition is the following: 

i?p:={(X,F,x,y), jx^ < X, |yr' < F} . 

By (|62|) it satisfies 



(64) < 5 < (p - 1) x^/py^/p' . 

We are going to prove that it also satisfies the following "differential" inequality. 
Denote v := (X,F,x,y), = (X„, r_, x_, y_), v+ = (X+, r+, x+, y+), let v,v+,V- 
lie in Rp, and let v = |(t;_ + f+). Then 



(65) B{v)--iBiv+) + B{v^)) > -|x+-x_||y+-y_|. 

The proof is verbatim the same as in Section 12.11 And this inequality in infinites- 
imal sense becomes 



(66) (fBp>2\d^\\dy\. 

Having the function Bp satisfying 

1) 0<Bp< {p-l)X^/PY^/P'; 

2) -d^Bp > 2\d^\\dy\. 

Assuming that Bp is sufficiently smooth (which incidentally it is, one can write 
the formula for Bp), we can repeat verbatim we can repeat the proof of Theorem 
12. 6t we start with analyzing ( 



(67) [--A]b{x,t) 
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exactly as in the proof of Theorem 12.61 the only difference that b now is not Bq o v 
but our BpO V and v also slightly different, it is now 

v{x, t) := t), t), f{x, t),g{x, t)) , 

where these are heat extensions of functions on with corresponding symbol. We 
estimate the expression in ( 1671) in a pointwise way from below using 2), and in the 
average on a slab, using 1) we got exactly (160|) . Theorem 13. and, therefore, (152|) . 

Remark. Notice that variables x, y are complex, they are "bench guards" ( "mesto- 
blyustiteli" ) for complex- valued functions f = u + iv,g = (j) + iip. So actually Bp is 
a function of 6 real variables, and, hence, fl66l) should be understood as 



(68) - d^Bp{X;Y;u,v;(f),^) = {HB,h,h) > 2V du^ + dv^ ^/ d(f)'^ + dijj^ 

where u,v,(j),ijj are just real variables (they are "bench guards" for functions with 
the same symbols and their heat extensions), and h = {dX,dY,du,dv,d(j),dilj) is a 
notation (strange may be) for an arbitrary vector in M^. □ 

To obtain (l53l) we notice first that in Theorem 13. II we can use Ri cos^ — i?2 sin^ in 
place of Ri, and Ri sin 6 + R2 cos 6 in place of R2- In fact this is just application of 
rotation on 6 in arguments. Then we notice that (-Ri cos 9 — R2 sin OY — [Ri sin 9 + 
i?2 cosdf = {Rl - Rl) cos 2^ - 2R1R2 sin 2^. Therefore, we got 

Theorem 3.2. For any (p e (0, 27r], \\{Rl - Rl)cos(j)- 2i?ii?2 sin0||p < p - I if 
p>2. 

We notice that a certain estimate of T = {R^ — R2) + 2ii?ii?2 can be obtained if 
we answer the following question. Suppose A, B are two operators in L^^fi), and for 
any angle ||y4cos0 — i?sin0||p < 1, then what is the estimate of \\A — iB\\p7 

This is easy on real functions, let / G L^reaAl^)^ ^^"^ A,_B map real functions 
to real functions {A = R\ — R\^ B = 2R1R2 are such). In fact. 



\f\^dfi> J \{Af){x) cos (t)+{Bf){x) sin (f)\Pdfi 
{\Af\' + \Bf\Y'\cos{a{x)-4>)\^df,{x). 



Integrate this over J^^ . . . , by Fubini' theorem we will get 

(69) I |/rrf/i> I (\Af\' + \Bf\Y'dix-^l^'^\ COS. 

Put 

r2n \ 1/P 



cosd)\^ t 



then on real functions 

(70) \\A + iB\\p < sup WAcoscp + B sin (f)\\p/T{p) . 

Unfortunately this was in real category. We do not know how obtain ( !70|) -or 
something like that-for general operators A, B on complex function. May be this is 
also an exercise? 

However, we will obtain now (1531) . First we need 
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The proof of (16111 . We use the following elementary lemma from Linear Algebra: 

Lemma 3.3 (Linear Algebra lemma). Let A,B,C be nonnegative matrices of size 
d X d. Let 

(71) {Ah, h) > 2{Bh, hf''^{Ch, hf/^ , MheC^. 

Then there exists t G (0, oo) independent of h such that 

{Ah, h) > T{Bh, h) + -{Ch, h), V/i G C^. 

r 

Proof. Exercise. □ 
We apply this lemma separately to hi, h2, where {f = u + iv, g = (p + iip, ) 
hi = (<9xi |/r(a;, t), d^^ \g\^'{x, t), d^^u{x, t), d^^v{x, t), d^^(f){x, t), d^^tp{x, t)) , 
h2 = {d^^\f\P{x, t), (9^2 \g\'''{x, t), d^.,u{x, i), d^.,v{x, t), d^^(t){x, i), d^^ilj{x, t)) , 

and A = HBp{\f\^{x, t), \g\P {x, t), u{x, t),v{x, t), (f){x, t),ip{x, t)), and B consisting of 
all zeros except 3,3 and 4,4 entries, where we have 1, and C consisting of all zeros 
except 5, 5 and 6, 6 entries, where we have 1. 
Then we immediately get (16T|) . 

□ 

The proof of (!53|) . We use the previous notations. We want a better estimate of 
T f = {A + iB){u + iv) = Au — Bv + i{Av + Bu). Using the trick above (1^9]) we can 
average the following equality over (0, 2-7?) 



I {Au — Bv) {x) cos (f) + {Av + Bu) {x) sin (j)\^dfi = 
j {\Au - Bv\'^ + \Av + Swpl^l cos(a(a;) - (i))\^dii{x) . 

Then we get 

t{p)-{[ |r/|P)^/P < sup( / \{Au-Bv){x) cos(P+{Av + Bu){x) sin0|P)i/P 



sup sup [{Au — Bv){x) COS (f) + {Av + Bu){x) sm(f)\ip{x) dx =: E 

tp real?/.,|!?/)||p/<l i 

However the last expression can be rewritten using fl56l) and integration by parts as 
follows: 

E = 2^ [[ {d,,+td,,)f{x,t){d,,+id,,)e-'U{x,t)dxdt < 
J Jul 

2V2 11 {\d^J\' + \d^J\'Y^' {{d,,i;)' + (9^,^)2) < V2{p - 1)11/11,, p > 2 . 
We used ( I6T1) . Here a/2 appeared trivially from 

1(5., +*4J/(x,t)| < V2 {\d,J\' + . 

Finally we get 
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Asymptotically this is 1.41... (p — 1). Choosing large p, interpolating between L^, 
where the norm of T is 1 and the estimate ( 1721) for this large p, then optimizing by 
the choice of p one can get (l53l) (exercise!). 

□ 

Notice that ( 16T1) immediately proves the following 

Theorem 3.4. 1) \\T : Ll^^^ L^W < y/2{j> - 1), p > 2; 

^;i(T/,^?)|<(p- 1)11/11 plls'lip', P > 2, if f,g are real valued. 

Proof. Just look at ([59]), compare it with ^ and the fact that {\DF\l-2 det DFY'^ < 
V2\DF\, F = {u,v). We also need to notice that in this inequality for real valued 
f = u + iO we have det DF = and the constant can be replaced by 1. Finish 
the proof: exercise. 

□ 

So everything above hinges on inequality (I62p . This inequality was proved by 
Burkholder in mid 80's and it is one of the remarkable inventions. It is done by 
use of Bellman function technique. 

3.1. The proof of inequality ( l62l) . Burkholder's Bellman function. We fol- 
low [19], [23], [25] -but loosely. See also the exposition in the review paper [3J. 

Let / be real valued on [0, 1] =: Jq. Let {hj}j^D be the usual Haar functions on 
Iq normalized in L^. Consider an operator 

TJ = ^lU. hi)hj , e := {ej}j , ej = ±l. 

This family is called martingale transforms. 
Burkholder proved the following remarkable 

Theorem 3.5. sup^ llT^Hp = p* — 1 := max(p,p/ (p — 1)) — 1 . 

He gave several proofs, all difficult, to be found in [I2]-[25]. Another proof by 
Vasyunin-Volberg see arxiv: 1006.2633, |59j . 

In all these proofs the following object is indispensable. It is Burkholder's Bell- 
man function. 

Let Q := {{x, y, z) : < z} and let 

B{x,y,z) := sup{||^7||^ : = a;, {g)u, = y, (|/|^)/o = ^,V/ G P \{g,hj)\ = \{f,hj)\}. 
Symmetries: 

(73) B{tx, ty, t^z) = B{x, y, z) , B{-x, y) = B{x, y) , B{x, -y) = B{x, y) . 

Burkholder found the formula for B\ 
Consider for positive x, y 

(yP- {p* - Ifx^ , iiy<{p*- l)x ; 
~\p(l- j,y~^ {y + xy-\y - {p* - l)x) , if y>{p*- l)x . 

Consider the solution of an implicit equation: 

F,{\x\,\y\) = F,{z'/P,B'/%x,y,z)). 
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li p > 2 Burkholder's function is the solution of this equation. If p G (1,2], then 
one considers Fp{\y\, \x\) = Fp{B^^^{x,y, z), z^^^). 
Obviously one gets a 

Theorem 3.6. 5(0, 0, 1) = {p* - 1)p , 

which gives Theorem 13.51 from which we get that (162|) is proved right away. In fact. 

The proof of ( !62l) . We write sup^ \ (Ti;f,g)\ < {p* — 1) ||/||p||5'||p', which follows from 
Theorem 13.51 But this supremum obviously is equal to 

J:iev\{f,hj)\\{g,hj)\. 

Therefore fl62l) is proved. □ 

Remarks. 1) As soon as (162!) is proved we have our Bellman function Bp. 

2) It gives all our inequalities like (16T]) and its consequences like (!53|) . 

3) It is not Burkholder's function. 

4) The existence of our Bellman function Bp follows from the existence of Burkholder's 
Bellman function. These are demographic creatures, they create one another— 
we saw this in previous sections too. 

We are left to prove Theorem 13.51 Instead of finding exact formula for B{x,y,z) 
listed above we will use a certain shortcut (invented already by Burkholder himself). 
Suppose Burkholder's B is finite. 

The shortcut proof of Theorem \3.5l Along with symmetries (173|) it has very good 
concavity properties: 

(74) B{x, y, z) — ^{BiyX + a^y + a, z + (5) + B{x — a,y — a, z — /?)) > , 
if all points lie in Q. Also 

(75) B{x, y, z) — -{B{x + a^y — a, z + (5) + B{x — a,y + a, z — (5)) > , 

if all points lie in Q. 

Inequalities fl7i|) . fl75|) are left as exercise. 
Notice that this means that 

M(a, b, c) := B{a + b,a — b,c) 

is concave in (a, c), and in (6, c). 

Definition. Such M is called bi-concave. 

Definition. Function ip on is called zigzag concave if 

'^{x, y) - ^(V5(a; + a,y + a) + Lp{x - a,y - a) >Q , 

^{^^ y) - ^(V5(a; + a, ?/ - a) + v?(x - a, ?/ + a) > , 
or, which is the same as, 

<^(a;> y) - ^(^(a;^5 y^) + <^(a^", y")) > , if 

= \y+ -y-\^x= ^(x++x"), y = ^(|/+ + ?/"). 
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Theorem 3.7. Put {p{x,y) := sup(^^y;,-^^Q[B{x,y,z) — {p* — l)^z]. It is zigzag con- 
cave. It is the least zigzag concave majorant of h{x,y) := \y\^ — {p* — iy\x\^. 
There is no zigzag concave majorant ip such that il){tx,ty) = t^ipi^^y) of function 
he '■= \y\^ - c \x\P if c < (p* - ly. 

Proof. Put Cp = {p* — ly. Fix {x^,y^) and {x^,y^). Find z'^ which almost gives 
supremum in ip{x^ ,y^) = sup[B {x^ , y^ , z) —Cpz]. Do the same for (p{x^, y^) to find 
z'^ . Then 

B{x~,y~,z~) - CpZ' < Lp{x',y') < B{x',y',z') - CpZ' + e , 

B{x~^, y~^, z~^) — CpZ^ < ip{x~^, y^) < B{x~^, y~^, z^) — CpZ^ + e . 
Let X = ^{x~^ + x^) , y = \{y^ + y^) and put z = ^{z^ + z^). Then 

ip{x, = sup ■ ■ • > B{x, y, z) - CpZ = B{x, y, z) - Cp^{z^ + z') > 

-{B{x ,y ,z )-CpZ ) + -{B{x+,y+,z+)-CpZ+) > -{i^{x ,y ) + ?/+)) -2^ . 
So ip is zigzag concave. Also 

ip{x, y) = sup ■ ■ ■ > lim [B{x, y, z) — Cpz] > \y\^ — Cp\x\^ = h{x, y) . 

z^\x\P+ 

So is a zigzag concave majorant of h. Why the least? Let ip be any zigzag concave 
function such that 

h<i). 

Put \I' := ip{x,y) + CpZ. Then it is easy to see that ^ satisfies dH]), (17^ . Also on 
dVt = {z = we have 

^(x, y, z) > h{x, y) + CpZ = h{x, y) + Cp\xY = \y\^ ■ 

Then combination of the last inequality and the fact that \E' satisfies (I7i|) , ( !75|) gives 
(attention exercise!) 

'^ix,y,z) > B{x,y,z) . 
This a non-trivial exercise. But then trivially for every (x, y) 

'ijj{x,y)= sup [^{x,y,z) - Cpz]> sup [B{x,y, z) - Cpz] = ip{x,y) . 

z:{x,y,z)^Q z:(x,y,z)(^Q 

We need now to prove that hc,c < Cp does not have zigzag concave homogeneous 
majorant. 

This and more is done in 

Lemma 3.8. Function h^, c < Cp does not have zigzag concave homogeneous majo- 
rant. If c = Cp, then the function =: h has such majorant given by 

(\y\P-{p*-iy= h{x,y), zfh<0; 

\p[i-^) {\y\ + \x\y-\\y\ - {p* - i)\x\) , tfh>Q. 

Another zigzag concave majorant of h = (but not the least) is given by 
^x,y) ■.= p(^l- i\y\ + \x\y-\\y\ - {p* - l)\x\) . 

Remark. The fact that ^Q{x,y) < if |x| > \y\ will be crucial for the proof of 
Theorem 13.51 
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The proof of Lemma lS.Si We work in the firs quadrant. Homogeneous can be 
written as 

/ \ / \ / a; y , y — x , 1 + s y 1 — s x 

ip{x,y) = {x+y)(p{— — , — — , s := — — , then — — = —— , — — = —— . 

x + y x + y y + x 2 x + y 2 x + y 



We put g{s) := V^(^, Next we hst some results of computations: 



x+y " x+y 

2 , 2 ")• Next ^^ 

= p{x+yY-'g{s) + {x+yr-'g'{s){l+s) , = p[x+yY-^g{s) + {x+yY-'g' {s){l-s) 
^^^ = p{p-l){x + yY-^g{s)-2{p-l){x + yY~^g\s){l + s) + {x + yY-'g'\s){l + s)\ 
^yy=p{p-lYx + yY-'g{s) + 2{p-l){x + yY-'g\s){l-s) + {x + yY-^g"{s){l-s)\ 

= p{p - + yY-'gis) - 2{p -l){x + yY-^g\s)s - {x + yY-^g'\s){i - s') . 

So on X + y = 1 

= -[(1 - s^)g"is) + 2{p - l)sg'{s) - p{p - l)g{s) , 

^xx + Vyy = 2(1 + s'^)g"{s) - 4(p - l)sg\s) + 2p{p - 1)51(5) . 
So combining the two: 

{d^-yY^P = iPxx - 2v?xs/ + ^yy = 4:g"is) . 

(5x+j/)V = ^xx + 2ip^y + (fiyy = Ag" (s) + Aifi ^y = 4 ( s V ( ^ ) + (p " 1 ) ( "2 s^' ( s ) + ^c/ ( s ) ) ) . 
Zigzag concave means the last two hues have sign < 0. To find ip satisfying these 
two < differential inequalities, let us try first to find it in such a way that the first 
inequahty is equahty! Hence we seek for the hnear g\ Then put 

(76) g{s) = a(^-±^-p^-' 



^2 '2 

Then the second inequahty s^g"{s) + {p — \){—2sg'{s) + pg{s)) < becomes 

(77) 2sg\s)-pg{s)>0, on [-1,1]. 

It is satisfied (as g is hnear) if and only if it is satisfied in —1 and 1. We get 

a(l + p - p) > , -a(l + p + pp)>0. 

As g is greater than {^^Y ~ ^ {^^Y ■> '^^ positive at s = 1, so a > 0. then we get 
from previous inequalities that 

p > max ( p — 1, ) = p* — 1 . 

Let us try linear g with p = p* — 1. So g has zero at Sp such that 

p — 1 = . 

1- Sp 

But if 

h{x, y) = {x + yYH{s) , H{s) := ^ - c/ 



then H has zero at the same point Sp. Now let us find a from the condition 

H'{sp)=g'{sp)^a = p(l-^ 



p^ 
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Where H is concave on [—1, 1]? Inflection point ip is such that 

— j j 

So it is clear that it is always > Sp. As H is concave on [— it is concave on 
[— 1, Sp] (and a little bit on the right of Sp too). 
It is also easy to see that on [—1, Sp] 

(78) s'^H"{s) + (j9 - l){-2sH\s) + pH{s)) < . 

This is an exercise. 

Put now 

_ J our linear g{s) s G [sp, 1] ; 

^^'^^\h{s), se[-i,sp\. 

Then ^Q^x.y) = (x + uYgi,^^, is exactly the same $o as in Lemma [3781 s 
statement. We just checked that it is a zigzag concave majorant of h{x,y). We 
also checked that $(x, y) = (x + vYgi^^, ;5;^), where is our linear function built 
above, is zigzag concave majorant of h{x,y) as well. It is exactly function $ as in 
Lemma \3Ms statement. 



Now let c < Cp. Linear function cannot be higher than corresponding on [—1, 1] 
and satisfy flTTjl . In fact, if as + (3 is higher, then a + (3 > 0. Also fl77j) gives 

2sa -p{a + P)>0^ a{2 - p)s - pf3 > 0,a{p - 2) - pP > . 

Then P < 0,a > 0. So linear function is positive in 1 and negative at zero. So it 
must vanish on [—1, 1], hence it has the form fl76l) . Hence, we we can see that their 
minorant can have only c > Cp. In fact, we remember that p > p* — 1 in fl76l) . 
Then the zero of our linear function must be > Cp. But if our linear function is a 
majorant of with c < Cp it is also a majorant of Hc^. Therefore, its zero must be 
< Cp. This is a contradiction, and a linear solution of two differential inequalities 
will not have minorant with c < Cp. Concave solution will not have such minorants 
either. Exercise. 

Lemma 13.81 is finished. 

□ 

Theorem 13.71 is completely proved. 

□ 

Finishing the proof of Theorem l37R The real case. Now that we have function $ 
($0 will work too) that is 

1) zigzag concave on the plane, 

2) is such that ^{x, y) > h{x, y) := \y\P - {p* - \y\x\f , 

we can do the following. Fix f^g step functions on / := [0,1]. Consider points 
P = {x,y) = {{f)i,{g)i), P+ = (a;+,y+) = ((/)/^, = {x-,y') = 

((/)/-, {9)1-)- Notice that of course P = |(P+ + P"). Also |x+ -x^\ = \y^ -y~\ 
because this differences are "^K/' ^i) "^Ks'; ^-f) correspondingly, and we as- 



sumed in Theorem 13.51 that for every dyadic interval \{f,hj)\ = \{g,hj)\. Let also 
> \y\ (for example both are zeros) 



32 



ALEXANDER VOLBERG 



Then we can use properties of $: 

> > + $(x-,y-)|J_| . 

As intervals I are as good as / we can repeat this for them. Just iterating this 
procedure and denoting by /'^ dyadic intervals of size 2"" with a being any string 
of ± of length n we get 

S^<l>(a;^l/'^)|r| <0. 
Combine this with property 2) above. Then 

But by our construction y'^ = {g)i<y,x'^ = {f)i<y. So we get 

s.i(^?)/.nri < (p* - irs.i(/)/.n/'^i . 

Going to the limit when n — )■ oo we get {\g\^)i < {p* — which gives the 

claim of Theorem 13.51 in the case of real- valued f,g. 

□ 

Finishing the proof of Theorem \3.5[ The complex-valued and Hilbert-valued cases. A 
certain "miracle" happens: $, $o have extra properties of symmetry, not apparent 
at this moment. 

Extra symmetry. Consider (p{x,y) = ^{>y xf + x^, \/yi + 2/2)- ^se standard 
notations, now x, y are vectors, || ■ || is the norm of a vector, dx := {dxi, dx2), dy := 
{dyi,dy2) are also arbitrary vectors. 
We want to see that 



1) _rf2^ _ 



dx 




dx 


dy_ 




dy_ 



) > 0, if \\dx\\ = \\dy\\. This is "zigzag concavity" 



direct analog. 

2) ^{x,y)>h{\\xl\\y\\). 

The second is obvious, but the first happens by a "miracle". Let us prove it and 
see, where the "miracle" happens. 

Calculations (really abusing the language we understand that ^x, are partial 
derivatives of $ with respect to the first and the second variables): 

(fix, = "^X ■ , , , ^X2 = "^x 



X2 



X2 



^x^xi = ^ 



XX 2 I 2 



+ ^x 



Ob n 



{xl + xlfl^ ' 
XIX2 



2 I 2 

Jb — P ^2 



^xxx-i 

Symmetrically for y derivatives. Also 
Therefore, 



'pX2X2 ■ 



X2 ,-v«2 
1" ^x 

xl + xl ^ {xf + xl)^/"^ 



XIX2 



[xl + xi)3/2 ■ 



Xiyj 



■ xy 



\/x\^x\^y\ + 



1/1 



dx 








dy_ 


5 




) - ,r « ( 



X2dX\ + XirfXj 



$2; / X2dX\ — X\dx2 



\x\ 



^y ( y2dy\ - y\dy2 



\V\ 



xy 



X2dx\ + xxdx2\ ( y2dy\ + y\dy2 



\x\ 



\V\ 



+Vyy 



+ 



y2dyi + i/irf?/2 



llz/l 
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= p_\\dx\\^ + pL\\dy\\^ + ^Udx^ 

\\x\\ \\y\\ \\x\\ \\x\\ \\y\\ \\y\\ 

where dx^ dy are projections of vectors dx, dy on direction orthogonal to x, y corre- 
spondingly. 

Recall that up to a positive constant (which we drop now abusing the language) 

y) = {y-{p- l)x){x + yf-^ , if P > 2 , 

and 

{p - l)<l>(x, y) = -{x -{p- l)y){x + yy-^ , if p < 2 . 

Let us consider p > 2, the other case being similar. Looking at the formulae above 
we get by direct calculation with formula for $ that for any numbers h' , k' 



^^^h'^+2^,yh'k'+%yk" = -p[p-l){x+yy~\h'^-k'^)-p{p-l){p-2)x{x+yY-\h'+k'f . 

(By the way we immediately see that this form is < if \k'\ = \h'\, which is infini- 
tesimal version of zigzag concavity.) 

Now let us combine our formulae, putting h' = {dx, n^), k' = {dy, -jylj). Then 



dx 




dx 


dy 


1 


dy_ 



^Mx\? + h\\dy\? - Pip - mM + \\y\\r\h'' - k") 
\\y\\ 



-p{p - 1)(P - 2) ||x|| (||a;|| + \\y\\r-%h' + k')' 
Let us look at the first line of the last formula. Calculate 



\x\ 



.\\dxr + -y-\\dy\\ 



X 



\y 

where Term := pl-dl/?-!!^ 
same is true for k. In particular 



^ ||4||2 + _ /,/2) ^ ^g^^ 

\\x\\ 



x\\ \\y 

^). This is just because + h'"^ 



^, and the 



(79) 
In fact, 

(80) 



Term = if 



and Term < , if 



> 



\x\ 



-pip 



kii + iii/iir'<o. 



Combine three last formulae. Then we have 



dx 




dx 


_dy_ 




dy_ 



+ 



\\dy\\' 



Fii \\y\ 

+Term - p{p - l){p - 2)\\x\\{\\x\\ + \\y\\Y-'\h' + k' f . 
The second line is obviously negative (see f l79|) ). To have the first line negative it is 
necessary and sufficient to have 



\y\ 



< 



Calculate: 



\\y\ 



p{\\y\\ - {p 



\x 



x\ 



Np-2 



Combine this with ( IHOl) to get 



^7 



M + \\y\ 



\p-2 



p{p - 1) - p + p{p - 2) 



X 
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f82) 



pip-2)- 



\x\ 



\y\ 



\p-i 



\y\ 



< , if p > 2 . 



The case p < 2 goes along the same hnes with corresponding change in the formula 
for $. The proof of Theorem 13.51 is finished in the complex-valued case. One can 
notice that the same proof works in any Hilbert space, not just 2-dimensional as 
above, exercise! 

□ 



Theorem is finally completely proved. 

We want to remember a formula that has been just obtained [p > 2): 



dx 




dx 


_dy_ 




dy_ 



□ 



pip-2) ^1'"^" + 1',^"^"'' ||4||2_p(p_l)(||^|| + ||^||)P-2(||^^||2_||^^||2) 



\y\ 



(83) _ _ _ 2)||a;||(||x|| + \\y\\Y~' ( (dx, ^) + (dy, /-r) 

\ W-^W \\y\\ 

where \\dyf = \\dyf - {dy, Also 



y 



p{i-^{ ^ > hr - {p* - irwxr . 



On the other hand, if 1 < p < 2, we know that 

^(x,y) = i\\yr-{p*-l)\\xmx\\ + \\y\\r-' 

will satisfy 



15) 



dx 




dx 


_dy_ 




dy_ 






-P{P 



-p{2-p)- 



kll + \\y\ 



\p-i 



\x\ 



■iMxf-p(p-i)(iix||+ibiir2(iMyf-iirfxf) 



- p{p - 1){2 - p)\\x\\{\\x\\ + \\y\\r-' {dx, --.) + {dy, 



X 



\x\ 



\y\ 



where = ||(ia;|p — (dx, ||fj|)^- The same majorization (J8^ happens for 1 < p < 2 

as well. 



Inequalities fl52|) . f l53|) are completely done. However, to move further, in partic- 
ular to flMj) . f l55|) . we need a new tool=stochastic integrals. 

4. Stochastic Integrals. Ito's formula 

Let w{s) := Ws denote Brownian motion started at 0, that is Wo = 0, and for all 
ti < ^2 < ^3, random variables — Wt^, Wt^ — Wt^ are Gaussian independent with 
zero average and variances \/t2 — ti, i/ta — t2 correspondingly. 

We want to understand what does it mean 

^{t)dwt . 



It is not the Riemann sum defintion as the following example shows. 
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Example. Consider two simplest Riemann sums built on a partition of the interval 
[a,b]: 

m 

El := ^ w{ti_i){w{ti) - w{ti_i)) , 

i=l 

m 

i=l 

If refinement is small, we should have had (if stochastic integral were a Riemann 
sum thing) that these two random variables Ei and E2 are close. Let us see, whether 
this is the case. 

Notice that uniformly (when the partition changes) they are in L^(0, J-", P), where 
(n, J-", P) is the probability space on which Brownian motion is given. 

E(Ei)2 = ^E(w(ti_i)(w(g - w(ti_i))w(t,_i) • {witi) - w(ti_i)) + 

i<j 

^¥.{{w{U-if{w{ti)-w{U^if) = J]E(w(ii_i)(w(g-«;(ii_i))w(i,_i)-E(^/;(ii)-w(ii 

i i<j 

i 

Also 

E(S2)' = 2ES2 + 2E{J2{{w{ti) - w{U.i))y < 

i 

26(6-a) + 2E(^a', 

where := w{ti) — w{ti-i) are Gaussian independent with average zero and af — 
\ti — ti^i\. Then 

E(5^a' = 2$^EC,2EC| + EC,^ = 

i<j 

2j2\ti- ti-i\\tj - tj-i\ + 3 J] |ti - < 5(6 - a)2 . 

i<j i 

The correct definition of integral should have been such that if Ei, E2 are uniformly 
in and arc both the Riemann sums, they should have been close in some sense. 
Suppose they are close (as random variables) in probability (one of the weakest 
sense possible). Then we use a simple exercise that if ||/n||L2(p) < C, =^0, then 

In our case, nothing like that happened: 

EEi = , 

EE2 = EEi + E{J2{w{U) - w{ti_i)f = J2iti - U-i) = 6 - a ^ . 

i i 

We understand now that stochastic integral J^^{t)dw(t) is a much more subtle 
thing than Riemann sum integral. Stochastic integrals were understood by Kioshi 
Ito. 
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4.1. A bit on Ito's definition. . 

Let BJ^ be a sigma algebra of sets A C M x such that for every t G [a, h] we have 
A n ((— C)0, t\ X VL) is in Bt x J-i, where Bt is Borel sigma algebra on (— cxd, t], J^j is a 
sigma algebra generated by {ws}s<t- Let M2[a, 6] is the set of functions measurable 
with respect to BJ^ such that 

(a) fit) is measurable with respect to Ft for each t, 

(b) with probability 1, \f{t)\^dt < oo. 

For all such random functions (random processes) Ito defines 

(86) ff{t)dw{t). 



Definition. / G M2[a,6] is called a step function if there exists a partition such 
that fit) = fiti)iu) ,for t G [tj,tj+i). We introduce the stochastic integral for them 
in a natural way 

fdt := fi^i)i'^) ■ i^iU+i) - w{ti)) . 



I 

J a 



Lemma 4.1. For every f G M2[a, b] there exits a sequence of step functions as above 
such that with probability 1 



hm / |/(t)-/„(t)|2rft = 0. 

J a 



Moreover if in addition 



E I \f{t)\^dt < oo 

J a 



then step functions can be chosen to have 

limE [\f{t)-frM'dt = 0. 

J a 

We need the following Lemma. 
Lemma 4.2. Let (f be a step function as above. Let 6,e > 0. Then 

ifit) dw{t) I >£}<- + P{ / |(^(t) p dt>6}. 

^ J a 

This lemma immediately gives the following reasoning. If-as above-/ G M2[a,b] 
and fn are step functions from Lemma 14. ![ then 

-6 



P- hm / |/(t)-/„(t)|2dt = 0. 

J a 



Then 



By definition 



b 

2 



P- lim / \f^{t)-fn{t)\'dt = 0. 

n,m— >oo 



V£ > , P{ /" |/^(t) - fnit)\^dt >e}^0,m,n-^oo. 

J a 
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Now we use Lemma 14.21 to have 

rb pb 



limsupP{| / fn{t)dw{t) - / f,^{t)dw{t)\>e} < - 

for any 6 > 0. So the sequence of random variables ^„ := J^^ fn{t) dw{t) is Cauchy 
convergent in measure (in probabihty). So in probabihty it converges to a certain 
random variable ^. This ^ is by definition fdw{t). Ito's stochastic integral 
is constructed. 

This integral has many nice properties: 

If E/^*|/(t)|2rft < oo, then Ej^f{t) dw{t) = and 

rb fb 



E{[ f{t)dw{t)f = E I \f{t)\'dt. 

J a J a 



If in addition E \g{t)\^ dt < oo then 

(87) E{f f{t)dw{t)- f g{t)dw{t)) = E f f {t) ■ g{t) dt . 

J a J a J a 

(Integral of the product is the product of integrals.) 

4.2. Stochastic differential. Let h{t) G M2[a, 6], and a[t) be measurable with 
respect to J^t for every t, and 



|a(t)| dt < oo . 

Suppose ({t) is a random process such that for all ti, ^2 such that a < ti < t2 < b 



((^2) - C(ii) = / a{t)dt+ h{t)dw{t). 
Jti Jti 

Then we write the above line as stochastic differential: 

d({t) = a{t)dt + h{t)dw{t) . 

Remark. If a = this integral is a martingale (obviously) on the filtration {J^t}t>o 
of sigma algebras generated by Brownian motions. 

4.3. Ito' formula. Let C, have the stochastic differential in the sense above and let 
u{t,x) be a (several times) smooth function. Consider new process 

Vit) :=M(t,C(t)). 
Theorem 4.3. Then t] also has stochastic differential and 

driit) = [u[{t, C{t) + Kit, at))a{t) + ^<,(t, C{t)) ■ b^t)] dt + Kit, at)) ■ bit) ■ dwit) . 
Proof. The proof is quite subtle. See [15], [62] . 

□ 
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Matrix Ito's formula also exists and will be used. Let a be m x 1 column of 
processes, a is a. m x k matrix of processes (with entries in M2[a,b]). Let W{t) be 
a column of k independent Brownian motion. Let C(t) be a m x 1 process with 
stochastic differential 

d({t) = a{t) dt + adW{t). 

Let u{t,x) be a smooth function, where x G M™. Let T]{t) = u{t,({t))- Then r] also 
has stochastic differential, and matrix Ito's formula gives 

(88) dr]{t) = [du/dt + V^.u(t, C) ■ a{t) + itrace((T/J„(t, Qa*)] dt + V ■ adW{t) . 
Here ■ is the scalar product in W^. 

4.4. Space-time Brownian motion. Let us discuss Theorem 14. 3[ If a = 0, then 
the process C is a martingale (see Remark before the theorem). However, it is quite 
unrealistic to expect that if we consider the composition of a non-linear function u 
and a martingale, then we would get another martingale. And in fact, if a = the 
formula in Theorem 14.31 becomes (if a = 0) 

(89) dvit) = K(t, at) + ^uiit, at)) ■ b\t)] dt + <(t, at)) ■ bit) ■ dwit) , 

and the "non-martingale" part (called drift) in square brackets is very much present. 
But there is one very important exception. 

Suppose / G and u^t.x) is the heat extension of /, in other words, the 
solution of the heat equation: 

Fix large positive T and consider function of (t, x) given by n = m-^(T — t, x). We 
want to compose it with stochastic process as in Theorem 14. 3[ with a = 0,6 = 1. 
Then we get the process 

■q ■= u^(T - t,Wt) . 
It will be a martingale on [0,T]. In fact, we can use fl89l) to get 

du^ 1 d'^u^ du^ 

Mt) = [—g^iT-t,wt) + --^iT-t,wt)]dt + —iT-t,wt)dwt, 

and by flUU]) the drift term in the brackets disappears. 

If we work with heat extension for functions on M.'' the same will be true. Now 
Brownian motion Wt is fc- dimensional (just k independent Brownian motions) and 
u-^ is the solution of heat equation 

(91) (^^_lA^^/ = 0,n^(0,x) = /(x). 

Then we get the following stochastic differential 

(92) du^iT - t, Wt) = V^u^iT - t, Wt) ■ dWt , 
where ■ is the scalar product in M.^. 
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We are interested now in the case of complex valued function / on M^, so = 2. 
Thinking that gradient is always column vector and Wt is 2-dimensional row vector 
it is convenient to rewrite fl92l) as 



(93) 



/(Wr)-w^(r,o)= rdWf 

Jo 



ufiT-t,Wt). 



Definition. The expressions dWt 



u-f {T — t,Wt) will be called heat mar- 



tingales. 

But we will need a bigger class, where heat martingales are supplemented by 
their martingale transforms. The simplest martingale transforms are given by 
expressions 



dWt-A 



(T - t, Wi 



t) , 



where A is a fixed matrix not depending neither on u (elementary event) nor on 
time t. 

Consider a special matrix 

(94) A := 



Then we get 

which is 

(95) 



i{dx + idy 



uf{T-t,Wt) 



2 f dWt 
Jo 



d 

id 



u^{T-t,Wt 



This is quite suggestive. In fact, denoting temporarily the Ahlfors-Beurling trans- 
form Rl — Rl + 2iRiR2 by symbol AB, we recall that ABd = d. The following 
theorem holds. 



Theorem 4.4. 



1) lim E( / dWt 



uf{T-t,Wt)\WT = z) = f{z), 



2) lim E( 

T-s>oo 



dWt 



dx + idy 
i{dx + idy 



uf{T-t,Wt)\WT = z)=AB{f){z) 



Proof. Let us consider a test function g and build a heat martingale X(t), < t < T, 
by formula 1), but with / replaced by g: 



X{t) ■.= g{T,0)+ [ dWs 
Jo 



u\T-s,Ws). 



Let Y{t),0 <t <T, denote the martingale in formula 2): 



Y{t):= I dW.. 

10 



dx + idy 
i{dx + idy) 



uf{T-s,Ws 
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Then by "rule" that the product of stochastic integrals is "the integral of the prod- 
uct", we get (below k{t;x,y) := * ) 

27rTE(Y{T)-X{T)) = 27iT [ [[ Bu^ {T -t; x,y) du\T -t; x,y)k{t; x,y) dxdy dt 

Jo J Jr2 



T 







27rT / // du'^{t;x,y)du^{t;x,y)k{T — t;x,y)dxdydt. 



Notice that 27c T k{T — t; x,y) ^ 1 if T goes to infinity. It is not then difficult to 
see that the last expression becomes very close to 

du^{t; X, y) du^{t; x, y) dxdy dt 







when T goes to infinity. 

Recall formula <^ and formula AB = Rj - + 2iRiR2.{ Number 2 in (|30|) 
should be dropped now as we are working with extensions with respect to ^ — |A 
unlike before formula ( l30l) . where we worked with ^ — A.) Combined they give us 
that the last expression would be equal to {AB{f),g) if the integration would be 
...dt and not ...dt. But as T is large and /, g are nice the "error" goes to zero 
when T goes to infinity. So 

(96) 27rrE(y(T) ■ X{T)) = {AB{f),g) + o(l) . 

On the other hand, X{T) = glWx) by fl93|) with / replaced by g. Therefore, for any 
test function g 

2nTE{Y{T) ■ giWr)) = 2^T [ dfiT{zMY{T) \ Wt = z)g{z) , 
where d^T = 2^^~~ dm2{z) is given by the density distribution of Wt. Now using 



T d/iT(z) 

LU. LliCXU Zj/l J. 

bounded fashion when T — )■ oo we obtain 



the facts that g is a nice test function and that 2t[T-£^^^ — ?■ 1 pointwise and in a 



I E(r(T) I Wt = z)g{z) dm2{z) = 2-kTE{Y{T) ■ c/(Wr)) + o(l) . 
Jc 

Comparing this with ( l96l) we get the formula 

(97) A5(/)(^) = lim E( / dWfAy^^yuf{T-t-Wt)\WT = z). 

Theorem is proved. □ 

Remark. It is very easy to see now that for martingale {V(t)}o<i<T constructed 
above 

P^(/)III.(c,.^.)<^l™,2vrTE|F(T)r 

for any p. It is a sort of averaging operator. Moreover, for martingale {X(t)}o<t<r 
we obviously have limiting equality 
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This is trivial from (]93l) : just raise both part to the power p and take the expectation 
(first conditioning over Wt = z, then integrating with respect to dfiriz)) and use 
again the fact that 2ttT^^^ — )■ 1. 
Now put g = f ■ We see that 

Pi?(/)||,<2M,||/||, 

follows from E|y|^ < MpE|Xp for martingales F, X. Notice that Y is just the 
martingale transform of X with the help of matrix A, whose norm is 2. This 
explains the constant 2 in the above display inequality. This is why we study below 
X, Y and their relationship. 

Remark. The reader can find many interesting examples, references and explana- 
tions in recent review of Banuelos devoted to Burkholder's estimate: [3]. 

4.5. Orthogonal (conformal) martingales. Introducing two martingales on the 
filtration of Brownian motion 

X{t) := [ dWs-V,y{T-s;Ws),0<t<T; 
Jo 

Y{t) := [ dWs- AV^^yU^iT ~ s]Ws) =: Ai.X{t) ,0 <t <T , 
Jo 

and using the previous remark, we get that it might be a fruitful idea to look for a 
sharp martingale transform inequality 

(98) Ep*X(t)||p < MpE||X||p. 

Let f = 4> + iip- Introduce the notations 

Ht = [utiT - s;Ws), utiT-s;Ws)] , 
= [vf{T - s; Ws), vf{T - s; Ws)] . 
K', = [ui{T - s; Ws) - u^{T - s; W^), mJ(T - s; W^) + ut{T - s; W^)] , 
^2 = - - ""tiT - s; Ws), utiT - s- Ws) - u^{T - s; Ws)] , 

we can write complex martingale X = Xi + 1X2, Y = AicX = Yi + iY2 in the form 
(below dWs is a 2-row-vector) 



(99) [Xi(t), X2(t)] = [ dWs\^l\= [ [Hldw] + H^dw^' 

Jo L-"2j Jo 



{K'^dwl + K^dwl 



(100) [Y,it), Y2{t)]= j\ws ^ 

Properties of H, K. Vector processes H and K are related by 

(101) \\Klf + \\K'^f < A{\\H^f + \\H^f) 

for all elementary events u and all times s. 

Relationship fllOip is called differential subordination of martingale Y to mar- 
tingale 2X. 

Theorem 4.5 (Burkholder's theorem). // martingale M is differentially subordi- 
nated to martingale N, then 

nM{t)\\, < {p* - l)E||X(t)||p,/ := max(p,p/p- 1) . 

The constant is sharp. 
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In particular, 

(102) E||v4*X||p < 2{p* - l)E\\X{t)\\p. 

But the constant is not sharp! We followed the probabilistic proof in [7], which 
"randomize" the idea of [58] . There is an analytic proof following [58] more directly, 
see 



More properties of H, K. Vector processes K have extra properties: 
(103) Kl ■K^ = 0, \\Kt\\ = \\K^\\ 



for all elementary events u and all times s. Such martingales are called orthogonal 
or conformal. 

Theorem 4.6 (Banuelos-Janakiraman's theorem). We make the exposition of [5] 
using the notations above. If martingale M is differentially subordinated to martin- 
gale N, and martingale M is conformal and ||M(0)|| < ||A^(0)|| then 



E||M(t)||,< J^^^E||iV(t)||,,p>2. 



In particular. 



(104) E\\A^X\\p < ^/2p{p - l)E\\X{t)\\p ,p>2. 

But the constant is not sharp! 
However, this inequality gives 

\\T\\p<^2pip-l),p>2. 

Interpolation between p = 2 and large p with this estimate, optimization in this 
large p, will give flM]) . 

The proof of Theorem \4.6\ Our main tool will be formula f l83p . Trivial renormaliza- 
tion shows that to prove Theorem 14.61 it is enough to prove that if M, are two 
martingales on the filtration of 2-dimensional Brownian motion and M is differen- 
tially subordinated to ' N , p > 2, and M is conformal then 

(105) E||M(t)||p<(p-l)E||iV(t)||p,p>2. 
Consider such M, A^, and their Hi, H2, Ki, K2. We know that 

(106) IliTf < 

p 

where \\K\\^ := \\Ki\\^ + HiTaf , ll^^f := \\Hif + and 

kii ■ hi + ki2 ■ k22 = . 
This and equality \\Ki\\ = \\K2\\ easily implies 

(107) kn-ki2 + k2i-k22 = 0. 

Let V{M,N) := ||M||p - {p-l)\\N\\P, p > 2, ^{M,N) := p{l - l/p)P-W\M\\ + 

\my~\\m-{p-i)\\N\\). 

We would like to prove that E{V {M (t) , N (t)) < 0. But it has been proved that 
V < (p. So it is enough to prove 

(108) E{ip{M{t),N{t))) <0. 
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^(M(t),iV(t)) = ^(M(0),iV(0))+ / d^iMis),Nis)). 



(109) 



To compute Kdip we use Ito's formula, which of course involves Hessian H^. More 
precisely, dip{s) will involve 









.^1 


5 


.^1 



)+ 



Now we look at formula (!83|) . which gives 

rf¥.=Kl-l/pr'(^+5+C^+^),^:=-Mp-l)(l|M(s)|| + ||iV(s)||)^-2(||i/| 



li^l 



B := -p{p-2){\\M{s)\\ + \\N{s)\\r-^M{s)\\ 



1-1 



+ 



M2k2l - Mik22 

IIMil 



where we need to recall that M = Mi + iM2, Mi, M2 being its real and imaginary 
parts. Part C comes from the last part of formula fl83l) . and, obviously, 

C < 0, 
D = ...dw\ + ...dwi , 

where ... involve functions of k^j,h^j and \/(p{M{s), N(s)). This shows that D(s) 
is a martingale starting at and so 



:iio) 



E / D(s) = 0. 



We open the brackets in B, use (I107p . and the fact that kfi + = /c^g + ^225 to 



get 



Now 



B<-p{p-2){^\\K\\l)i\\M\\ + \\N\\Y-' 



A + B = -p(\\M 



mr'iip - mm 



i-l\\m<0: 



if (I106p is valid. Term C is non-positive. Term D disappears after integration E , 
As a result we come to (see (11091) ): 

E [ d<^{M{s), N{s)) = E(^(M(0), N{0)) = <^{M{0), N{0)) < , 
Jo 

because if x := ||M(0)|| < y := ||A^(0)|| then ip{x,y) < 0, which is obvious from the 
formula for (p. 

□ 

As we already mentioned, this proves (15^ . To prove (l55i) one needs even more 
careful stochastic analysis, and we leave this for the next round of lectures somewhere 
in the future. 
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5. Bellman function of Stochastic Optimal Control problems 

Let Ws be di dimensional Brownian motion. Let x{t) is a d-dimensional process 
given by 

(111) x{t) = X + / b{a{s) , x{s)) ds + / a{a{s), x{s)) dWg , 

Jo Jo 

in other words the process starts at a; G M'^ and satisfies a stochastic differential 
equation 

dx{t) = b{a{t),x{t)) dt + a{a{t), x{t)) dWt , 

where a is a (i2-dimensional control process, we can choose it ourselves, but it must 
be adapted, that is a{s) has to be measurable with respect to sigma algebra J-'s 
generated by Wt,0 < t < s. Also values of the process a are often restricted: 

Matrix function a is smooth and d x (ii-dimensional, and 6 is a smooth column 
function of size d. Everything happens in i7 C M'^ (often =M'^). 

The choice of adapted process a{s) gives us different motions, all started at the 
same initial x G M.'^. 

This is a "broom" of motions, hidden elementary even u gives "one stem of a 
broom" . 



Suppose we are given the profit function f{a,x), meaning that on a trajectory of 
x(t), for the time interval [t,t + At], the profit is f{a(t),x(t)) + o(At). So on the 
whole trajectory we earn 

/■oo 

f{a{t),x{t))dt. 

We are also given the pension-we call it bonus function F-how much one is given 
at the end of the life. We want to choose a control process a = a{s) to maximize 
the average profit: 



(112) f"(x):=E/ f{a{t),x{t))dt + \imsupEF{x{t)). 

Jo t~^oo 

If 6 = and F is convex then one ca write lim instead of limsup. 
The optimal average gain, or 

(113) V := supf "(x) 

a 

is called the Bellman function of stochastic optimal control problem (lllll) . 01121) . 

Usually the analysis consists of 

a) writing Bellman PDE on v; 

b) solving it; 

c) using "verification theorem" , which says that under certain conditions on data 
a, b, F, /, Q, A the classical solution of Bellman PDE is exactly v from (11131) . 
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5.1. Writing Bellman PDE.. This consists of a) Ito's formula, b) Bellman's prin- 
ciple of dynamic programming. 
Using Ito's formula (IHHl) we get 

dv '^^ 

dv{x{s)) = ^— afej(a(s), x{s)) dwl+ 

f-^ dxk ^ 

fc=i j=i 

dv 



— {x{s))bk{a{s), x{s)) ds+ 



k=\ 



d 



2 ^ dxidx^ 
i,j=i 



{x{s))a'^{a{s),x{s)), 



where 



di 



a'^{a,x) := ^aik{a,x)akj{a,x) 

k=l 

is i, J matrix element oi dx d matrix aa* . 

Introduce two linear differential operators with non-constant coefficients: 

d 



Ci{a,x) := y^^bk{a,x)- 

k=l 

d 



d_ 

' dxk 



and 



x) := x) + £2(0:, x) . 

Let us hit our formula for dv{x{t)) above by the expectation E, then the first line 
becomes 0, and we get 

^ ' '"^^ E[C{a{t),x{t))v]{x{t)) . 



E 



dt 



vixit)) 



Or 



(114) Ev{x{t)) =v{x)+E [Ci{a{s),x{s)) + C2ia{s),x{s))]v{x{s))ds . 

Jo 

Now we need the second ingredient to write the Bellman equation: the Bellman 
principle^ dynamic programming principle. It is in this next equality: 

POO 

v{x) = supE[ / f{a{t), x{t)) dt + limsup . . . ] 

a Jo t— >oo 



(115) 



snpE[[ f{a{t),x{t))dt + v{x{t))], Vt > . 

a Jo 



A minute though shows that this reflects the stationarity of Brownian motion and 
the fact that to be perfect one has to be perfect every second. 

Now plug Ev{x{t)) from ffTTil) into ffTTHD . We get 
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I'OO 

= supE[/ f{a{t),x{t)) + C{a{t),x{t))v{x{t))\dt, Vt > 

o Jo 

Divide by t and tend t to zero. We "obtain" Bellman equation: 



(116) 



sup[(£(q;, x)v){x) + /(a, x)] = 



Positivity (usually present) of / and convexity (usually present) of F imply (if 
there is no drift, that is if h{a,x) = 0) obstacle condition: 



(117) v{x) > F{x) , yxen. 

Often it becomes boundary condition: 

(118) v{x) = F{x), yxedn. 

The definition of v in domain (not in the whole M.'^) should be slightly changed. The 
integration of profit function now is not from zero to infinity, but from zero to the 
stopping time of the first hit of dQ by the trajectory x{t). 

See details of obtaining f lll6p in the beautiful book of N. Krylov [17]. 

In applications one is also interested in supersolutions of the Bellman equation 

m ■■ 



;ii9) 



sup[£(q;, x)V{x) + f{a, x)] < 0,x E Q 

v{x) > F{x) , X en . 



Lemma : Let V solves (4) and let v be the Bellman function, then V > v m VL. 

Proof : Equation f lll6p states that —C{a,x)V{x) > f{a,x). Using f lll4p for V and 
then flllDp . one gets 

V{x) =EV{x{t)) -E [ {C{a{s),x{s))V){x{s))ds 



>EF{x{t)) + E [ f{a{s),x{s))ds. 
Jo 



Writing lim of both parts, we get V{x) > v°'{x). It rests to take the supremum over 



the control process a. 

5.2. Special matrices a bring us to Harmonic Analysis. Let us consider a 
very simple matrix a not depending on x: 



(120) 



di = 1 , a{a, x) 



=: a . 



If on the top of that 6 = then operator C just involves Hessian matrix of 
function v. 

1 d"^ 1 

{C{a)v){x) = 7^^ dxdx- ^^^^ " -{H^{x)a,a) . 
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We claim that this is the generic case of Harmonic Analysis problems in M}. Equation 
(11161) becomes 



121) 



sup[|(iJt,(x)a, a) + f{a, x)] = 0,x E fl 

aeA 

v{x) > F{x) , X E Q . 



If 6 7^ then we just add the first order differential operator (called drift): 
snp[^{H^{x)a,a) + Y.k=ibkia, x)-£-v{x) + f{a,x)] = 0,x E^l , 

aeA 

v{x) > F{x) , X E Q . 



(122) 



Harmonic analysis on "becomes" the analysis of the following Bellman equation 
(and this is exactly what we did in the sections above devoted to the analysis of the 
Ahlfors-Beurling operator): 



(123) 



di = 2 , a{a, x) 



au ai2 



-: a . 



Conformal restrictions: Matrix a can have restrictions a E A of the type that 
the first row is orthogonal to the second row and that the norms of the rows are 
equal. The reader can notice that these are Cauchy— Riemann conditions, and 
the corresponding solution of fillip will be a conformal martingale (again if 6 = 0). 
Bellman equation becomes 



, sn-p[^tiace{a* H^{x)a) + Ylt=i^ki(y, x)-^v{x) + f (a, x)] = 0,x E 
(124) aeA ^ 

vix) > Fix) , X G . 



Remarks. 1) This is (exactly as (11221) ) very non-linear (actually an example of 
so-called fully non-linear) equation of the second order. 

2) This equation is much more difficult to analyze than (11221) . On the other hand, 
we can easily notice that conformal restrictions on a makes clear that Hessian of v 
should be replaced by Laplacian of v (or some kind of semi-Laplacian-semi-Hessian) . 



6. Examples showing almost perfect analogy between Stochastic 
Optimal Control and Harmonic Analysis 

6.1. Aoo weights and associated Carleson measures. Buckley's inequality. 

We call a nonnegative function on M an weight (dyadic A^o weight actually) if 



(125) {w)j < Cie^^°^^^' ,^JeV . 

Here 15 is a dyadic lattice on M, ( ■ )j is the averaging over J. We are going to 
illustrate our use of Bellman function technique by a collection of examples, the 
first of which is the result of Buckley that can be found (along with "continuous 
analogs" ) in the paper of Fefferman-Kenig-Pipher [55] • 
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Theorem 6.1. Let w G A^o- Then 



2 



(126) ^^^^'m ^ [ (w) I 1^"^^- 

Where C2 depends only on Ci in fll25p . Here i± are right and left sons of i eV. 

Who moves ? 



Xi,X2 = {w)j , (log to) J 
"1 = (w)sonofj - => \ai\ = ^\{w)j_ - {w)jj . 

Function of profit can be read off (11261) if one notices that|j| ^ ■ ■ ■ is the 

average over the hnes of hfe. Each hne of hfe initiates at I and then proceeds to 
leii^i = +1 orei = —1), then to Ie^e2{^2 = +1 or 62 = —1), etc. 

Thus 4| Yl ■ ■ ■ plays the role of E ■ ■ ■ . This allows us to choose the correct 
profit function 



f{a,x) = ^ 



Bonus function F = here. Bellman equation reads now 



(127) sup 

o=(ai,02) 



{H^a, a) + 







to be solved in 

(128) VL = {{xi,X2):l<Xie-''^ <ci] 
with the obstacle condition 

(129) v{x) > Vx G ^] . 
Compare with (I12ip ! 

6.2. A two- weight inequality. 

'iJ ev {u)j{v)j <i^yi ev 

Who moves ? 

Xi,X2 = {u)j , {v)j. 

As in the previous problem f°'{x) is easy to find : 

/(a, x)) = 4|ai| \a2\ ■ 
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sup [{Hya, a) + 8\ai\ \a2\] = Ov > in fl = {x = {xi,X2) '■ < xi,X2; xiX2 < 1} 

a={ai ,02) 



Compare with fll2ip ! 

6.3. John-Nirenberg inequality : Bellman equation with a drift but with 

f{a,x) = 0. 

VJ G V{\<^ - {<^)j\^)j <6^WI eV 



Who moves ? 



Xi = {if) J , X2 = {\(p - = 



ICJ 



Notice that {t = n): 



xl-E{xl+'\x')=X2-^^^^ 



- \ 2 



,t\2 



But 



x*i -E(x*i+^|x*) =xi 



X ^ ~\~ Ob -j^ 



0. 



On the other hand, 



i+l 



bds 



H Jt 

Thus drift h stands for E(x*'*"^|x*) — (in the case of discrete time). Therefore, 
= ( 2 ) ill our case. Notice that f{a,x) = as there is no |j| in the 
functional. Bellman equation in this case has a form 



sup 

o=(ai,02) . 



-{a va, a) — - — a 
dxo 



0. 



Compare with ffT22D ! 



In other words : 
(130) 



' dx'f dx2 dxidx2 



< , det 



dx'f 8x2 dxidx2 



\ dx\dx2 dx'2 / 



\ dxidx2 dx'2 / 



0. 



in Qs = = {xi,X2),Xi e M, < X2 < 6}. The obstacle condition is 
(131) v{x) > F{x) = in Vis 
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Denote the dyadic Bellman function of a corresponding problem. There are 
many solutions of the above equation in Qs which satisfy the obstacle condition 
> e^^ in Qs and even satisfying the boundary condition = e^^ on X2 = 0. These are 

X2) = g (l^V;!ZMe-i+v^-v^, 6<e<l,q>l. 
i — y/e 

One can compute f 5-the smallest solution of the above equation satisfying the ob- 
stacle condition. 

This is not Bp. In fact, B^ > vs- However, vs is the Bellman function for non- 
dyadic John— Nirenberg inequality!!! The rest is in Vasyunin's lectures and in 
[531. 



6.4. Burkholder-Bellman function. 

yieV\{g)j^-{g)j_\<\{f)r^-{f)j_\ 
^MI eV such that \{g)i\ < |(/)/| 

one has 

M')i<{p-^r{\mi p>'^- 

The constant {p — 1)^ is sharp. This is a famous theorem of Burkholder which he 
proved by constructing the corresponding Bellman function. He found it by solving 
a corresponding Bellman PDE - a complicated one. We would like to show a simple 
"heuristic" method of solution. 



Who moves ? 



xi = {g)j , X2 = {f)j X3 = . 
Our rules say that /(a, x) = 0, KF{x\, Xg, ^3) ~ ^\g\^- Denoting by J^t the cr-algebra 
generated by dyadic subintervals of / of length 2~"|J|,t = 2"^, we can write El^fl^* « 
E|(Ea;i|J-t)|^ = E|x^|^ which gives us the correct bonus function X2, ^3) = 
Notice that A = {a = {ai, 02,0^3) : |ai| < |«2|} now. 

This is because |ai| = l\{g)j+- {g)j_\,\a2\ = - and we are given 

that the first quantity is always majorized by the second one. 

So we have the Bellman equation 

sup {H^a, a) = 

|oi|<|a2| ,03 

in Q = {x : {xi,X2,X3) : \x2\^ < X3}(convex), with obstacle condition 

v{Xi,X2,X3) > \Xi\^ . 

Compare with fll2ip ! 

This example is interesting because we have a non-trivial set of restrictions A for 
"control" a. 

Solutions were given by Burkholder [19] (see also [20]-[25]) and also (a different 
approach using Monge-Ampere equation) can be found in [59]. See also a very 
interesting review 
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An interesting Bellman function built by the use of Monge-Ampere equation can 
be also found in [60], l6T]. 



7. The technique of laminates, Bellman function, and estimates of 

singular integrals from below 

Definition. Laminate on M2X2 is a positive finite measure on symmetric real ma- 
trices M|^2 such that 

(132) f{A)> j f{A + M)dv{M) 

for all rank 1 concave functions /. 

Theorem 7.1. Any laminate on M|^2 can he approximated weakly by the push for- 
ward of Lebesgue measure on the plane by the Hessian of smooth compactly supported 
functions, in other words, for any good F 



F{M) dv{M) 



F{Du) dxdy . 



where Du{x, y) :- 



^yx '^yy 



X 
Y 



are just exactly 



Observation. Laminates supported by diagonal matrices 
exactly the measures on such that {z = {X, Y)) 

(133) /(a)> / f{a + z)du{z) 

Jc 

for all bi-concave (meaning separately concave in X and Y) function /. 

Definition. (J X du, J Y dv) is called baricenter of a laminate v supported on 
diagonal matrices. 



Fix p > 2 and = p + 77, 77 > 0. Put 



So : = 



Pri 



p — rj — 1 



K+1 



- 2 ' " ' K -1 

We are going to construct very interesting laminates supported on 

Y = KX,Y + l,X. 

K 



Fix p > 2, fix small f] > 0, put 



(134) so:=l--,K 
Pv 



1 



Pr,-=P + V: 
Pri 

Pv = 



K 



2K K+l 



So Pv-2' 

We are going to present an interesting laminate with baricenter (1, 1) supported 
by lines 

Lk: Y = KX, LyK-. ^ = J^X- 
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Let / be a bi-concave function and 
(135) fiz) = Oi\zn,z^oo 

Then concavity in horizontal variable gives 



(136) 



Rewrite it as 



t + /i-i(t + /i)^^^ ^ ^ h 1 



(137) /(t + /i,t + /i) < ' f{t,t + h)- i,,^,J ij7it + h),t + h) . 

The concavity in vertical variable gives 

From fll37p . f ll38p we obtain (of course we divide by and next, we will make h 
tend to 0) 



f{t + Kt + h)-f{t,t) ^ 1 



1 + 1 



t-j^{t + h) t-j^t + h 



f{t,t + h)- 



fiht + h),t + h)- , ,\ , , fit, ^t) . 



Make h 0. Then 

^^^^^ ^ ^''^^ ' W^i^ - 'Y^^t K-1 t ■ 

f{t,t) ' 



We recall (HMD and multiply by l/tP". Notice that after that LHS - , 

We integrate from 1 to oo and use f ll35p to forget the term at infinity. Then we 
obtain for any bi-concave function on the plane 

(140) - /(1, 1) < -— - / fi-t, t)—-- —— / fit, -t) 



Introduce i'K,r}- 

K r ,A , dt 

It is a laminate supported by '■ Y = KX. And introduce Vi/K,'q'- 

K 1 ^ dt 

It is a laminate supported hj Li/k '■ Y = j^X. Now (11400 can be rewritten as 



JJd.yK„ = j^J^ <Pit,-t)- 



:i41) /(1,1)> j fidiyK,rj + diyi/K,r,) 
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If all our concavity in getting f ll4ip become linearities then we have equality in flMip . 
So di'K,ri + di'i/K,ri IS a laminate with baricenter (1,1). 
Consider a new laminate, now with baricenter (0,0): 

1,, , N le 

Test it on 

MX, Y) = \X + Y\^, MX, Y) = \X- Y\P . 

Then 

.^42) / ^^dfiK,, _ \mK + 1)^ +{K + iy/KP)r^-' + 1{K - 1) 

Choosing 77 > very small we get 

Notice that we can consider a bit different laminate than f^K,ri, Namely let us push 
forward /ix,r? by the map X — )• X, F — )• —Y. The new measure is called o"x,r?- Then 
(11431) transforms to 

Now we use Theorem 17. 1[ It implies that there exist smooth functions with 
compact support on the plane such that 

(145) , > -fT^ -Cr]. 



K-1 

Notice that K depends on 77 (see (11341) ) but 

Thus from fll45p we get the estimate 

(146) \R\- Rl\^>v-\, 

if p > 2. 

This argument can be applied to some other interesting singular operators. Con- 
stant p* — 1 can be described as the smallest constant c = Cp such that the function 

/ic(X, F) = |F + X|P - (?\Y - X|P 

has a bi-concave majorant. 

Definition. Let us call v'p(X, F) the smallest bi-concave majorant of hc{X, Y) = 
\Y + X\P — cP\Y — X\P for the smallest (as we know) possible c = Cp = p* — 1. 

We will recall a formula for ipp in the next section. 

Now let us consider a different family (it is a perturbation of he): 
h,^r := liiY + Xf + T^{X - r)2)i/2|P _ ^v\Y _ x\P . 
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Here is a result proved in [T8] . 

Theorem 7.2. For sufficiently small universal Tq > 0, any p G (l,oo), and any 
T G [— tqjTo], the smallest c for which there exists a bi-concave majorant of h^^r is 



a 



[r) = ((j9*-l)2 + r2)V2. 



Using the same considerations with laminates as above (especially Theorem I7.ip 
we can prove the following estimate from below for "quantum" linear combination 
of secon order Riesz transforms: 

Theorem 7.3. For sufficiently small t and any small positive e one can find g G 
U'{m2) such that 

- Rl)g\' + r\R\ + R^^'X > (if " 1)^ + rYXh - e ■ 

This gives rise to the following problem: 
Problem. For sufficiently small r 

II ^1 ~ ^2 



tI 



: LP{m2) LP(M^ = {{p* - 1)' + ? 



The answer is affirmative, see [IB]. Notice that for p G (1,2) and large r this is 
no longer true. Somewhere we have a "phase transition" of the sharp constant. It 
is not clear what is the critical t{p). 

7.1. "Explanation" of laminates above via Burkholder's function Lpp(X,Y) 
and its properties. We introduce coordinates {x,y): 

Y = y + x,X = y — X. 

Let 

p* 

In the first and second quadrants of xy, Burkholder's function in these coordinates 
is equal to (here the reader should glance at ( 11341) and make = in it, Sq and k 
below are as in (11341) . but with rj = 0) 

, . , . f7p(z/-(p*-l)k|)(|x|+i/)P-i, if ^>So:=l-^ = i 

Now extend ipp{x^y) to the whole plane by 

^p{x,y) = ipp{-x,-y) . 

Burkholder proved [19j 

Theorem 7.4. Such a function coincides with the smallest majorant of hc{x,y) = 
\y\^ — c^|x|^, c = p* — \ bi-convex in X,Y coordinates. For c G [0,p* — 1) there is 
no such bi-concave majorant of he- 



Observation 2. We use here both coordinates {X,Y) and {x,y). In the cone 
X < Y < KX function ipp is linear along Y = const segments. Similarly, In the 
cone -^X <Y<X function (pp is linear along X = const segments. 
This linearity allows to calculate (we are in {X, Y) now) 

(Pp{t + h,t + h) - ipp{t, t) 
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virtually without any loss if we use the T-shape 4-tuple of points in M^: {{^{t + 
h),t + h), {t,t + h), {t + h,t + h), (t, ^t)) as in Section El 

If we move one of the lines Lk, Li/k then two things may happen: 1) we go outside 
of these linearity cones, and subsequently we get strict inequality for (pp{l, 1), or 2) 
if we do not go outside of linearity cones, but then we loose 

(pp = hp*-i 

equality because by the definition of of ipp (see fll47p ) this equality holds only on 
the boundary of and outside of the union of linearity cones. 

Notice also that on these lines Lx, Li/k (recall that K = ^^^^^ if p > 2) we have 
that 

iPp ^ V-i ~ , > V-i ■ 

For c larger than p* — 1 we can again choose the lines where coincides with its 
bi-convex majorant, but then they will be quite negative there and integration of 
he along a laminate supported on such lines cannot be almost positive as it was the 
case above. 



8. Stochastic Calculus and 1/2 quasiconvexity 



We have a bijection of matrices M = 

-d + i{b + c). Notice that 2 det M 
Recall that Sverak's function is the following "simple" object 



a, b 
c, d 

'kP - 



onto {z,w): z = a + d + i{b — c), w 



\w\ 



S{z, w) :- 



z\ — \w\ 




\z\ + \w\ <1 



1, otherwise 



Function 



ijp{z,w) ■= ((p - 1)1^1 - I^Dd^l + \w\Y-\ p>2, 

can be easily obtained from S using the idea of Iwaniec, see e.g. [9]. The process is 
a certain averaging. Therefore, the fact that S is rank-1 convex implies that ipp is 
also rank-1 convex. 

To solve the Big Iwaniec problem of the previous sections it would be enough that 
any of these functions is quasiconvex at zero matrix. This is an outstanding and 
very difficult problem. 

On the other hand we can formulate two problems which seem to be easier and 
may be readily reachable by Stochastic Calculus methods: 
Problem. Prove that S{z, ^w) is a quasiconvex function at zero matrix. 

At least we feel that the following problem is directly reachable by methods of 
Stochastic Calculus: 

Problem. Prove that ipp{^^ > 2, is a quasiconvex function at zero matrix. 

See interesting results in recent paper [3]. 
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